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5 IMMUNOREACTIVE HEPATITIS C VIRUS POLYPEPTIDE COMPOSITIONS 

Technical Field 

This invention relates generally to 
immunoreactive polypeptide compositions, methods of using 
10 the compositions in immunological applications, and 
materials and methods for making the compositions. 

Background 

The hepatitis C virus has been recently 

15 identified as the major causative agent of post- 
transfusion Non-A, Non-B hepatitis (NANHB) , as well as a 
significant cause of community- acquired NANBH. 
Materials and methods for obtaining the viral genomic 
sequences are known. See, e.g. PCT Publication Nos . 

20 WO89/04669, WO90/11089 & W090/14436. 

Molecular characterization of the HCV genome 
indicates that it is a RNA molecule of positive polarity 
containing approximately 10,000 nucleotides that encodes 
a polyprotein of about 3 011 amino acids. Several lines 

25 of evidence suggest that HCV has a similar genetic 

organization to the viruses of the family Flaviviridae , 
which includes the flavi- and pestivirus. Like its 
pesti- and flaviviral relatives, HCV appears to encode a 
large polyprotein precursor from which individual viral 

30 proteins (both structural and non- structural ) are 
processed. 

RNA- containing viruses can have relatively 

high rates of spontaneous mutation, i.e., reportedly on 

- 3 - 4 

the order of 10 to 10 per incorporated nucleotide. 
35 Therefore, since heterogeneity and fluidity of genotype 
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are common in RNA viruses, there may.be multiple viral 
isolates, which may be virulent or avirulent, within the 
HCV species . 

A number of 'different isolates of HCV have now 
5 been identified. The sequences of these isolates 

demonstrate the limited heterogeneity characteristic of 
RNA viruses . 

Isolate HCV Jl . 1 is described in Kubo, Y. et 
al. (1989), Japan. Nucl . Acids Res. 17:10367-10372; 
10 Takeuchi, K. et al.(1990). Gene 91:287-291; Takeuchi et 
al. (1990), J. Gen. Virol. 71:3027-3033; Takeuchi et al . 

(1990) , Nucl. Acids Res. 18:4626. 

The complete coding sequences plus the 5'- and 
3' -terminal sequences of two independent isolates, 
15 "HCV- J" and "BK", are described by Kato et al . and 

Takamizawa et al, respectively. (Kato et al . (1990), 
Proc. Natl. Acad. Sci. USA 87:9524-9528; Takamizawa et al 

(1991) , J. Virol. 65:1105-1113.) 

Other publications describing HCV isolates are 
20 the following; 

"HCV-l": Choo et al (1990), Brit. Med. 
Bull. 46:423-441; Choo et al . (1991), Proc. 
Natl. Acad, Sci. USA 88:2451-2455; Han et al - 
(1991), Proc. Natl. Acad. Sci. USA 88:1711- 
25 1715; European Patent Publication No. 318,216. 

"HC-Jl" and "HC-J4": Okamoto et al . 
(1991), Japan J. Exp. Med. 60:167-177. 

"HCT 18", "HCT 23", "Th" , "HCT 27", "ECl" 
and "EClO": Weiner et al . (1991), Virol. 
30 180 : 842-848 . 

"Pt-1", "HCV-Kl" and "HCV-K2": Enomoto et 
al. There are two major types of hepatitis C 
virus in Japan. Division of Gastroenterology, 
Department of Internal Medicine, Kanazawa 
35 Medical University, Japan. 
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Clones "A", "C", "D" & "E" : Tsukiyama- 
Kohara ec al . , A second group of hepatitis 
virus, in Virus Genes , 

5 A typical approach to diagnostic and vaccine 

strategy is to focus on conserved viral domains. This 
approach, however, suffers from the disadvantage of 
ignoring important epitopes that may lie in variable 
domains . 

10 It is an object of this invention to provide 

polypeptide compositions that are immunologically cross- 
reactive with multiple HCV isolates, particularly with 
respect to heterogeneous domains of the virus. 

15 Summary of the Invention 

It has been discovered that a number of 
important HCV epitopes vary among viral isolates, and 
that these epitopes can be mapped to particular domains. 
This discovery allows for a strategy of producing 
20 immunologically cross -reactive polypeptide compositions 

that focuses on variable (rather than conserved) domains. 

Accordingly, one embodiment of the present 
invention is an immunoreactive composition comprising 
polypeptides wherein the polypeptides comprise the amino 
25 acid sequence of an epitope within a first variable 

domain of HCV, and at least two heterogeneous amino acid 
sequences from the first variable domain of distinct HCV 
isolates are present in the composition. 

Another embodiment of the invention is an 
30 immunoreactive composition comprising a plurality of 

antigen sets, wherein (a) each antigen set consists of a 
plurality of substantially identical polypeptides 
comprising the amino acid sequence of an epitope within a 
first variable domain of an HCV isolate, and (b) the 
35' amino acid sequence of the epitope of one set is 
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heterogeneous with respect to the amino acid sequence of 
the analogous sequence of at least one other set. 

Another embodiment of the invention is 
an immunoreactive composition comprising a plurality of 
5 polypeptides wherein each polypeptide has the formula 

Rr- (SVJ,-R'^. 

wherein 

R and R' are amino acid sequences of about 
1-2000 amino acids, and are the same or different; 

^ and r' are 0 or l, and are the same or 

different; 

V is an amino acid sequence comprising the 
sequence of an HCV variable domain, wherein the variable 
domain comprises at least one epitope; 

S in an integer > l, representing a selected 
variable domain; and 

n is an integer > l, representing a selected 
HCV isolate heterogeneous at a given SV with respect to 
at least one other isolate having a different value for 
n, and n being independently selected for each x; 

X is an integer > i; and 
with the proviso that amino acid sequences are present in 
the composition representing a combination selected from 
the group consisting of (i) iv, and iVj, (ii) iVj and 2V2, 
25 and (iii) IVj and 2Vi , 

Yet another embodiment of the invention is a 
method for preparing an immunogenic pharmaceutical 
composition HCV comprising: 

(a) providing an immunoreactive composition as 
3 0 described above; 

(b) providing a suitable excipient; and 

(c) mixing the immunoreactive composition of 
(a) with the excipient of (b) in a proportion that 
provides an immunogenic response upon administration to a 

35 mammal . 



20 
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Still another embodiment of the invention is a 
method for producing anti-HCV antibodies comprising 
administering to a mammal an effective amount of ".n 
immunoreactive composition as described above* 
5 Yet another embodiment of the invention is a 

method of detecting antibodies to HCV within a biological 
sample comprising: 

(a) providing a biological sample suspected of 
containing antibodies to HCV; 
10 (b) providing an immunoreactive composition 

described above; 

(c) reacting the biological sample of (a) with 
the immunoreactive composition of (b) under conditions 
which allow the formation of antigen-antibody complexes; 

15 and 

(d) detecting the formation of antigen- 
antibody complexes formed between the immunoreactive 
composition of (a) and the antibodies of the biological 
sample of (b) , if any. 

20 Another embodiment of the invention is a kit 

for detecting antibodies to HCV within a biological 
Sctmple comprising an immunoreactive composition as 
described above packaged in a suitable container. 

25 Brief Description of the Figures 

Figure 1 schematically shows the genetic 
organization of the HCV genome. 

Figure 2 shows a comparison of the deduced 
amino acid sequences of the El protein encoded by group I 
30 and group IX HCV isolates. 

Figure 3 shows a comparison of the amino acid 
sequences of the putative E2/NS1 region of HCV isolates. 

Figure 4 are graphs showing the antigenicity 
profiles for the amino- terminal region of the putative 

35 
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HCV E2/NS1 protein (amino acids 384-420), and the gp 120 
V3 hypervariable region of HIV-l. 

Figure 5 shows a series of graphs which give 
the percentage probabilities that a given residue from 
5 the amino- terminal region of HCV E2/NS1 protein (amino 
acids 384 to 420) will be found in either alpha-helix, 
beta-sheet or beta- turn secondary structural motif. 

Figure 6 are bar graphs showing the reactivity 
of antibodies in the plasma from HCV 18 (panels A-C) or 
10 Th (Panels D-f) with overlapping biotinylated 8mer 

peptides derived from. amino acids 3 84 to 415 or 416 of 
HCV isolates HCT 18 (A,D), Th (B,E) and HCV Jl (C,F), 
respectively. 

Figure 7 shows the deduced amino acid sequences 
15 of two regions of the E2/NS1 polypeptide, amino acids 
384-414 and 547-647, given for the Ql and Q3 isolates. 

Figure 8A shows the deduced amino acid 
sequences of isolates HCV Ji.i and Ji.2 from amino acids 
384 to 647. Figure 8B shows the deduced amino acid 
20 sequences of isolates HCT27 and HCVEl from amino acids 
384 to 651. 

Figure 9 shows the entire polyprotein sequence 
of isolate HCV-l. 

25 Modes of Practicing the Invention 

The practice of the present invention will 
employ, unless otherwise indicated, conventional 
techniques of molecular biology, microbiology, 
recombinant DNA, and immunology, which are within the 

30 skill of the art. Such techniques are explained fully in 
the literature. See e.g., Maniatis, Fitsch & Sambrook, 
MOLECULAR CLONING; A LABORATORY MANUAL (2nd ed. 1989); 
DNA CLONING, VOLUMES I AND II (D.N Glover ed. 1985); 
OLIGONUCLEOTIDE SYNTHESIS (M.J. Gait ed, 1984) ; NUCLEIC 

35 . ACID HYBRIDIZATION (B.D. Hames & S.J. Higgins eds . 1984); 



1 
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TRANSCRIPTION AND TRANSLATION (B.D. Hames & S.J. Higgins 
eds, 1984); ANIMAL CELL CULTURE (R.I. Freshney ed. 1986); 
IMMOBILIZED CELLS AND ENZYMES ( IRL Press, 19 86) ; B. 
Perbal, A PRACTICAL GUIDE TO MOLECULAR CLONING (19 84); 
5 the series, METHODS IN ENZYMOLOGY (Academic Press, Inc.); 
GENE TRANSFER VECTORS FOR MAMMALIAN CELLS - (J.H. Miller 
and M.P. Calos eds. 1987, Cold Spring Harbor Laboratory), 
Methods in Enzymology Vol. 154 and Vol. 155 (Wu and 
Grossman, and Wu, eds., respectively) , Mayer and Walker, 
10 eds. (19 87), IMMUNOCHEMICAL METHODS IN- CELL AND MOLECULAR 
BIOLOGY (Academic Press, London), Scopes, (1987), PROTEIN 
PURIFICATION: PRINCIPLES AND PRACTICE, Second Edition 
(Springer-Verlag, N.Y.), and HANDBOOK OF EXPERIMENTAL IM- 
MUNOLOGY, VOLUMES I-IV (D.M. Weir and C. C. Blackwell eds 
15 1986) ; IMMUNOASSAY: A PRACTICAL GUIDE (D.W. Chan ed. 

1987) • All patents, patent applications, and publica- 
tions mentioned herein, both above and below, are 
incorporated by reference herein. 

HCV is a new member of the Family Flaviviridae 
20 which includes the pestiviruses (Hog Cholera Virus and 
Bovine Viral Diarrhea Virus) and the Flaviviruses , 
examples of which are Dengue and Yellow Fever Virus. A 
scheme of the genetic organization of HCV is shown in 
Figure 1. Similar to the flavi- and pestiviruses , HCV 
25 appears to encode a basic polypeptide domain ("C") at the 
N- terminus of the viral polyprotein followed by two 
glycoprotein domains ("El", "E2/NS1"), upstream of the 
nonstructural genes NS2 through NS5. The amino acid 
coordinates of the putative protein domains are shown in 
30 Table 1. 



wo 93/06126 



-8- 



PCT/LS92/07683 



10 



Table 1. The Putative Protein Domains in HCV 

a. a. coordinates (approximate) Protein 

1 - 191 C 

192 - 383 El 

384 - 750 E2/NS1 

751 - 1006 NS2 

1007 - 1488 NS3 

1489 - 1959 NS4 

1960 - 3011 NS5 



As discussed above, a number of HCV isolates 
have been identified. Comparative sequence analysis of 
complete and partial HCV sequences indicates that based 
upon homology at the nucleotide and amino acid levels, 

15 HCV isolates can be broadly sub-divided into at least 
three basic groups (Table 2). See Houghton et al . , 
(1991) Hepatology 14:381-388. However, only partial 
sequence is available for the isolates in group III. 
Therefore, when the sequences of these isolates are more 

20 defined, one or more of these isolates may deserve 

separation into a different group, including a potential 
fourth group. Table 3 shows the sequence homologies 
between individual viral proteins of different HCV 
isolates as deduced from their nucleotide sequences. It 

25 can be seen that the proteins of the same virus group 
exhibit greater sec[uence similarity than the same 
proteins encoded by different virus groups (Table 3) . 
One exception to this is the nucleocapsid protein that is 
highly conserved among all group I and II viral isolates 

3 0 sequences to date. (In Table 3, the symbol N/A signifies 
that the sequences were not available for comparison.) 
For purposes of the present invention, therefore, group I 
isolates can be defined as those isolates having their 
viral proteins, particularly El and E2/NS1 proteins, 

35 about 9 0% homologous or more at the amino acid level to 
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the isolates classified as group I herein. Group II is 
defined in an analogous manner. Future groups can 
likewise be defined in terms of viral protein homology to 
a prototype isolate. Subgroups can also be defined by 
homology in limited proteins, such as the El, E2/NS1 or 
NS2 proteins, or by simply higher levels of homology. 



10 



15 



Table 2 



Classification of hepatitis C viral 



aenome RNA 


sequences into 


three basic arouDS 


HCV I 


HCV II 


HCV III 


HCV-1 


HCV-Jl.l 


Clones A,C,D&E 


HC-Jl 


HC- J4 


HCV-K2 (a&b) 


HCT 18 


HCV-J 




HCT 23 


BK 




Th 


HCV-Kl 




HCT 27 






ECl 






Pt-1 







20 



25 



Table 3. Amino Acid Homologies (%) Between Viral 

Proteins Encoded by Different HCV Isolates 



HCV £ 
Group 

I compared to 
I 98-100 

II 97-98 

III N/A 



El E2/NS1 NS2 



NS3 



NS4 



NS5 



94-100 N/A N/A N/A N/A 99-100 
77-79 78-81 75-77 91-92 90-93 84-88 
N/A N/A N/A 86 76-80 71-74 



30 II compared to 

II 98-100 92-100 89-100 93-100 94-100 97-100 95-100 
III N/A N/A N/A N/A 84 76 74-75 



35 



III compared to 

III N/A N/A N/A N/A N/A 91-100 89-100 
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Ic is noteworthy that the putative viral 
envelope proteins encoded by the El and E2/NS1 genes show 
substantial amino acid sequence variation between groups 
5 I and II. Only NS2 exhibits a greater degree of 

heterogeneity, while the C, NS3, NS4 and NS5 proteins all 
show greater sequence conservation between groups . The 
sequence variation observed in the putative virion 
envelope proteins between groups I and II reflects a 

10 characteristic segregation of amino acids between the two 
groups. An example of this is shown in Figure 2 where 
the sequence of the El gene product is compared between 
viruses of groups I and II. The El amino acid sequences 
deduced from nucleotide sequences of HCV groups II and II 

15 are shown. In the figure, the horizontal bars indicate 
sequence identity with HCV-l, The asterisks indicate 
group- specif ic segregation of amino acids; the group- 
specific residues can be clearly identified. Group I 
sequences are HCV-l, HCT18, HCT23, HCT27, and HC-Jl. 

20 Group II sequences are HC-J4, HCV- J, HCV Jl.l, and BK. 
Such group- specif ic segregation of amino acids is also 
present in other gene products including gp72- encoded by 
the E2/NS1 gene. Figure 3 shows the comparative amino 
acid sequence of the putative E2/NS1 region of HCV 

25 isolates which segregate as group I and group II. The 

latter protein also contains an N- terminal hypervariable 
region ("HV") of about 30 amino acids that shows large 
variation between nearly all isolates. See Weiner et al . 
(1991) , supra. This region occurs between amino acids 

30 384 to 414, using the amino acid numbering system of 
HCV-l . 

The putative HCV envelope glycoprotein E2/NS1 
may correspond to the gp53 (BVDV) /gp5 5 (Hog Cholera Virus) 
envelope polypeptide of the pestiviruses and the NSl of 

35 




wo 93/06126 



-11- 



PCr/US92/07683 



the f laviviruses , both of which confer protective 
immunity in hosts vaccinated with these polypeptides. 

Striking similarities between the 
hypervariable region ("HV") and HIV-i gpl20 V3 domains 
5 with respect to degree of sequence variation, the 
predictive effect of amino acid changes on putative 
antibody binding in addition to the lack of defined 
secondary structure suggest that the HV domain encodes 
neutralizing antibodies. 

10 The immunogenicity of the domain is shown by 

antibody epitope mapping experiments, described in the 
Examples. The results of these studies suggest that in 
addition to the three major groups of HCV, HV specific 
sub-groups also exist. 

15 Analysis of biological samples from individuals 

with HCV induced NANBH indicate that individuals may be 
carrying two or more HCV variants simultaneously. Two 
CO- existing HV variants were found in the plasma of one 
individual, Jl . In addition, partial sequencing of the 

20 gene of an individual with chronic NANBH, who had 

intermittent flares of hepatitis, revealed that the 
individual, Q, was infected with two HCV variants (Ql or 
Q3) . Each variant was associated with only one episode 
of the disease. An ELISA using a Ql or Q3 specific 

25 peptide (amino acids 396-407) showed that Q developed an 
antibody response to the Ql peptide but not the 
corresponding Q3 peptide, suggesting that Q's 
recrudescence of disease was due to the appearance of an 
HV variant. The presence of antibodies to the Ql peptide 

3 0. but lack of humoral immune response to the Q3 peptide 
during the second episode of disease suggest that 
variation in the HV domain may result from the pressure 
of immune selection. Amino acids 396-407 appear to be 
subject to the greatest selective pressure in the HV 

35 domain. These findings support the thesis that high 
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levels of chronicity associated with .the disease might be 
due to an inadequate immunological host response to HCV 
infection and/or effective viral mechanisms of 
immunological evasion. Moreover, they point to the 
5 E2/NS1 HV region as a genetic region involved in a viral 
escape mechanism and/or an inadequate immunological 
response mechanism (s) . 

As discussed above, there are several variant 
regions within the HCV genome. One or more of these 

10 regions are most likely involved in a viral escape 

mechanism and/or an inadequate immunological response 
mechanism. Therefore, it is desirable to include in 
. compositions for treatment of HCV polypeptides which 
would induce an immunogenic response to these variants . 

15 In that the El and E2/NS1 regions of the genome 

encode putative envelope type polypeptides, these regions 
would be of particular interest with respect to 
immunogenicity . Thus, these regions are amongst those to 
which it would be particularly desirable to induce and/or 

20 increase an immune response to protect an individual 

against HCV infection, and to aid in the prevention of 
chronic recurrence of the disease in infected, 
individuals. In addition, these regions would be amongst 
those from which it would be desirable to detect HCV 

25 variants which are arising during the course of 

infection, as well as super- or co- infection by two or 
more variants . 

The present invention describes compositions 
and methods for treating individuals to prevent HCV 

30 infections, and particularly chronic HCV infections. In 
addition, it describes compositions and methods for 
detecting the presence of anti-HCV antibodies in 
biological samples. This latter method is particularly 
useful in identifying anti-HCV antibodies generated in 

35 response, to immunologically distinct HCV epitopes. This 
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method can also be used to study the evolution of 
multiple variants of HCV within an infected individual. 
In the discussion of the invention, the following 
definitions are applicable. 
5 The term "polypeptide" refers to a polymer of 

amino, acids and does not refer to a specific length of 
the product; thus, peptides, oligopeptides, and proteins 
are included within the definition of polypeptide. This 
term also does not refer to or exclude post -expression 

10 modifications of the polypeptide, for example, 

glycosylations , acetylations, phosphorylations and the 
like. Included within the definition are, for example, 
polypeptides containing one or more analogues of an amino 
acid (including, for example, unnatural amino acids, 

15 etc.), polypeptides with siabstituted linkages, as well as 
other modifications known in the art, both naturally 
occurring and non- naturally occurring. 

As used herein, A is "substantially isolated" 
from B when the weight of A is at least about 70%, more 

20 preferably at least about 80%, and most preferably at 

least about 90% of the combined weights of A and B. The 
polypeptide compositions of the present invention are 
preferably substantially free of hximan or other primate 
tissue (including blood, serum, cell lysate, cell 

25 organelles, cellular proteins, etc.) and cell culture 
medium. 

A "recombinant polynucleotide" intends a 
polynucleotide of genomic, cDNA, semisynthetic, or 
synthetic origin which, by virtue of its origin or 
30 manipulation: (1) is not associated with all or a portion 
of a polynucleotide with which it is associated in 
nature, (2) is linked to a polynucleotide other than that 
to which it is linked in nature, or (3) does not occur in 
nature . 



35 
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A "polynucleotide" is a polymeric form of 
nucleotides of any length, either ribonucleotides or 
deoxyribonucleotides . This term refers only to the 
primary structure of the molecule. Thus, this term 
5 includes double- and single -stranded DNA and RNA. It 

also includes known types of modifications, for example, 
labels which are known in the art, methylation, "caps", 
substitution of one or more of the naturally occurring 
nucleotides with an analog, internucleotide modifications 
such as, for example, those with uncharged linkages 
(e.g., phosphorothioates, phosphorodithioates , etc.), 
those containing pendant moieties, such as, for example 
proteins (including for e.g., nucleases, toxins, 
antibodies, signal peptides, poly-L- lysine , etc.), those 
15 with intercalators (e.g., acridine, psoralen, etc.), 
those containing chelators (e.g. , metals, radioactive 
metals, etc.), those containing alkylators , those with 
modified linkages (e.g., alpha anomeric nucleic acids, 
etc.), as well as unmodified forms of the polynucleotide. 
20 "Recombinant host cells", "host cells", 

"cells", "cell lines", "cell cultures", and other such 
terms denoting microorganisms or higher eukaryotic cell 
lines cultured as unicellular entities refer to cells 
which can be or have been, used as recipients for a 
25 recombinant vector or other transfer polynucleotide, and 
include the progeny of the original cell which has been 
transfected. It is understood that the progeny of a 
single parental cell may not necessarily be completely 
identical in morphology or in genomic or total DNA 
complement as the original parent, due to natural, 
accidental, or deliberate mutation. 

A "replicon" is any genetic element, e.g., a 
plasmid, a chromosome, a virus, a cosmid, etc., that 
behaves as an autonomous unit of polynucleotide 



30 



35 



wo 93/06126 



-15- 



PCT/US92/07683 



replication within a cell; i.e., capable of replication 
under its own control . 

A "vector" is a replicon further comprising 
sequences providing replication and/or expression of the 
5 open reading frame. 

"Control sequence" refers to polynucleotide 
sequences which are necessary to effect the expression of 
coding sequences to which they are ligated. The nature 
of such control sequences differs depending upon the host 

10 organism; in prokaryotes , such control sequences 

generally include promoter, ribosomal binding site, and 
terminators; in eukaryotes, generally, such control 
secjuences include promoters, terminators and, in some 
instances, enhancers. The term "control sequences" is 

15 intended to include, at a minimum, all components whose 
pr-esence is necessary for expression, and may also 
include additional components whose presence is 
advantageous, for example, leader sequences which govern 
secretion. 

20 A "promoter" is a nucleotide sequence which is 

comprised of consensus sequences which allow the binding 
of RNA polymerase to the DNA template in a manner such > 
that mRNA production initiates at the normal 
transcription initiation site for the adjacent structural 

25 gene. 

"Operably linked" refers to a juxtaposition 
wherein the components so described are in a relationship 
permitting them to function in their intended manner. A 
control sequence "operably linked" to a coding sequence 
3 0 is ligated in such a way that expression of the coding 

sequence is achieved under' conditions compatible with the 
control sequences . 

An "open reading frame" (ORF) is a region of a 
polynucleotide sequence which encodes a polypeptide; this 
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region may represent a portion of a coding sequence or a 
total coding sequence. 

A "coding sequence" is a polynucleotide 
sequence which is transcribed into mRNA and/or translated 
5 into a polypeptide when placed under the control of 

appropriate regulatory sequences. The boundaries of the 
coding sequence are determined by a translation start 
codon at the 5' -terminus and a translation stop codon at 
the 3 terminus . A coding sequence can include but is 

10 not limited to mRNA, DNA (including cDNA) , and 
recombinant polynucleotide sequences. 

As used herein, "epitope" or "antigenic 
determinant means an amino acid sequence that- is 
immunoreactive. Generally an epitope consists of at 

15 least 3 to 5 amino acids, and more usually, consists of 
at least about 8, or even about 10 amino acids. As used 
herein, an epitope of a designated polypeptide denotes 
epitopes with the same amino acid sequence as the epitope 
in the designated polypeptide, and immunologic 

20 equivalents thereof. 

An "antigen" is a polypeptide containing one or 
more epitopes. 

"Immunogenic" means the ability to elicit a 
cellular and/or humoral immune response. An immunogenic 

25 response may be elicited by immunoreactive polypeptides 
alone, or may require the presence of a carrier in the 
presence or absence of an adjuvant. 

"Immunoreactive" refers to (1) the abiility to 
bind immunologically to an antibody and/or to a 

30 lymphocyte antigen receptor or (2) the ability to be 
immunogenic . 

An "antibody" is any immunoglobulin, including 
antibodies and fragments thereof, that binds a specific 
epitope. The term encompasses, inter alia , polyclonal, 

35 monoclonal, and chimeric antibodies. Examples of 
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chimeric antibodies are discussed in U.S. Patent Nos . 
4,816,397 and 4,816,567. 

An "antigen set" is defined as a composition 
consisting of a plurality of substantially identical 
5 polypeptides, wherein the polypeptides are comprised of 
an amino acid sequence of one defined epitope . 

"Substantially identical polypeptides" means 
polypeptides that are identical with the exception of 
variation limited to the typical range of sequence or 

10 size variation attributable to the polypeptide's method 
of production; e.g., recombinant expression, chemical 
synthesis, tissue culture, etc. This variation does- not 
alter the desired functional property of a composition of 
substantially identical polypeptides; e.g., the 

15 composition behaves immunologically as a composition of 
identical polypeptides v The variations may be due to, 
for example, alterations resulting from the secretory 
process during transport of the polypeptide, less than 
100% efficiency in chemical synthesis, etc. 

20 As used herein, a "variable domain" or "VD" of 

a viral protein is a domain that demonstrates a 
consistent pattern of amino acid variation between at 
least two HCV isolates or sxibpopulations . Preferably, 
the domain contains at least one epitope. Variable 

25 domains can vary from isolate to isolate by as little as 
1 amino acid change. These isolates can be from the same 
or different HCV group(s) or subgroup(s). Variable 
domains can be readily identified through sequence 
composition among isolates, and examples of these 

30 techniques are described below. For the purposes of 

describing the present invention, variable domains will 
be defined with respect to the amino acid number of the 
polyprotein encoded by the genome of HCV-1 as shown in 
Figure 9, with the initiator methionine being designated 

35 position 1. The corresponding variable domain in another 
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HCV isolate is determined by aligning the two isolates 
sequences in a manner the brings the conserved domains 
outside any variable domain into maximum alignment. This 
can be performed with any of a number of computer 
5 software packages, such as ALIGN 1.0, available from the 
University of Virginia, Department of Biochemistry (Attn: 
Dr. William R, Pearson). See Pearson et al . , (1988) 
Proc. Natl. Acad. Sci. USA 85:2444-2448. It is to be 
understood that the amino acid numbers given for a 

10 particular variable domain are somewhat subjective and a 
matter of choice. Thus, the beginning and end of 
variable domains should be understood to be approximate 
and to include overlapping domains or subdomains, unless 
otherwise indicated. 

15 An epitope is the "immunologic equivalent" of 

another epitope in a designated polypeptide when it 
cross -reacts with antibodies which bind immunologically 
to the epitope in the designated polypeptide. 

Epitopes typically are mapped to comprise at 

20 least about five amino acids, sometimes at least about 8- 
amino acids, and even about 10 or more amino acids. 

The amino acid sequence comprising the HCV 
epitope may be linked to another polypeptide (e.g., a 
carrier protein) , either by covalent attachment or by 

25 expressing a fused polynucleotide to form a fusion 

protein. If desired, one may insert or attach multiple 
repeats of the epitope, and/or incorporate a variety of 
epitopes. The carrier protein may be derived from any 
source, but will generally be a relatively large, 

30 immunogenic protein such as BSA, KLH, or the like. If 
desired, one may employ a substantially full-length HCV 
protein as the carrier, multiplying the number of 
immunogenic epitopes. Alternatively, the amino acid 
sequence from the HCV epitope may be linked at the amino 

35 terminus and/or carboxy terminus to a non-HCV amino acid 



wo 93/06126 



-19- 



PCT/US92/07683 



sequence, thus the polypeptide would be a "fusion 
polypeptide". Analogous types of polypeptides may be 
constructed using epitopes from other designated viral 
proteins. 

5 A "variant" of a designated polypeptide refers 

to a polypeptide in which the amino acid sec[uence of the 
designated polypeptide has been altered by the deletion, 
substitution, addition or rearrangement of one or more 
amino acids in the sequence. Methods by which variants 
10 occur (for example, by recombination) or are -made (for 
example, by site directed mutagenesis) are known in the 
art . 

"Transformation" refers to the insertion of an 
exogenous polynucleotide into a host cell, irrespective 
15 of the method used for the insertion, for example, direct 
uptake, transduction (including viral infection) , f- 
mating or electroporation. The exogenous polynucleotide 
may be maintained as a non- integrated vector, for 
example, a plasmid or viral genome, or alternatively, may 
20 be integrated into the host genome. 

An "individual" refers to a vertebrate, 
particularly a member of a mammalian species, and 
includes but is not limited to rodents (e.g.", mice, rats, 
hamsters, guinea pigs) , rabbits, goats, pigs, cattle, 
25 sheep, and primates (e.g., chimpanzees, African Green 
Monkeys, baboons, orangutans, and humans). 

As used herein, "treatment" refers to any of 
(i) the prevention of infection or reinfection, as in a 
traditional vaccine, (ii) the reduction or elimination of 
30 symptoms, and (iii) the substantial or complete 

elimination of the virus. Treatment may be effected 
prophylactically (prior to infection) or therapeutically 
(following infection) . 

The term "effective amount" refers to an amount 
35 of epitope-bearing polypeptide sufficient to induce an 
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iinmunogenic response in the individual to which it is 
administered, or to otherwise detectably immunoreact: in 
its intended system (e.g., immunoassay). Preferably, the 
effective amount is sufficient to effect treatment, as 
5 defined above. The exact amount necessary will vary from 
application. For vaccine applications or in the 
generation of polyclonal antiserum/ antibodies , for 
example, the effective amount may vary depending on the 
species, age, and general condition of the individual, 

10 the severity of the condition being treated, the 
particular polypeptide selected and its mode of 
administration, etc. It is also believed that effective 
amounts will be found within a relatively large, non- 
critical range. An appropriate effective amount can be 

15 readily determined using only routine experimentation. 

As used herein, a "biological sample" refers to 
a sample of tissue or fluid isolated from an individual, 
including but not limited to, for example, plasma, seriim, - 
spinal fluid, lymph fluid, the external sections of the 

20 skin, respiratory, intestinal, and genitourinary tracts, 
tears, saliva, milk, blood cells, tumors, organs, 
biopsies and also samples of in vitro cell culture 
constituents (including but not limited to conditioned 
medium resulting from the growth of cells in cell culture 

25 medium, e.g., Mab producing myeloma cells, recombinant 
cells, and cell components) . 

The immunoreact ive polypeptide compositions of 
the present invention comprise a mixture of isolate- or 
group- specif ic epitopes from at least one HCV VD. Thus, 

3 0 there will be present at least two heterogeneous amino 

acid sequences each defining an epitope found in distinct. 
HCV isolates located in the saime or substantially same 
physical location in an HCV protein; i.e. each sequence' 
maps to the same location within the HCV 

35 genome/polypeptide . Since the sequences are 
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heterogeneous, the location is referred to as a variable 
domain (VD) . 

To better understand the invention, first the 
individual aunino acid sequences that make up the 
5 compositions of the invention will be explained. Then 
the plurality of such sequences which are found in the 
compositions of the present invention will be discussed. 

The amino acid sequence that characterizes the 
polypeptides of the present invention have a basic 
10 structure as follows: 

Ly-Z-L'y. (I) 
Z represents the amino acid sequence from a region of a 
protein from. a selected HCV isolate, where the region 
comprises at least one variable domain and the variable 
15 domain comprises at least one epitope. L and L' are 

non-HCV amino acid sequences or HCV amino acid sec[uences 
that do not contain a variable domain, and which can be 
the same or different. y and y' are 0 or l and can be 
the same or different. Thus, formula I represents an 
20 amino acid sequence comprising the sequence of an HCV VD, 
wherein the VD comprises an epitope. 

As discussed above, the epitope (s) in . Z will 
usually comprise a minimum of about 5 amino acids, more 
typically a minimum of about 8 amino acids, and even more 
25 typically a minimum of about 10 amino acids. 

The variable domain of Z can comprise more than 
one epitope. The variable domain of Z is at least as big 
as the combined sequences of the epitopes present, thus 
making it typically a minimum of about 5 amino acids when 
30 a single epitope is present. Since epitopes can overlap, 
the minimum amino acid sequence for combined epitopes in 
the variable domain may be less than the sum of the 
individual epitopes' sequences. 

Z is the amino acid sequence of an HCV isolate 
35 comprising the above -described VD. Thus, the minimum 
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sxze of Z is the minimum size of the VD. Z can comprise 
more HCV amino acid sequence than just the VD, and can 
further comprise more than one VD. The maximum size of Z 
IS not critical, but obviously cannot exceed the length 
Of the entire HCV polyprotein. Typically, however, Z 
will be the sequence of an entire HCV protein 
(particularly El, E2/NS1, NS2 , NS3 , NS4 and NS5) or even 
more typically, a fragment of such an HCV protein Thus 
Z will preferably range from a minimum of about 5 amino ' 
acids (more preferably about 8 or about 10 amino acids 
minimum) to a maximum of about iioo amino acids (more 
preferably a maximum of about 500, more preferably a 
maximum of about 400 or even more preferably a maximum of 
about 200 amino acids maximum) . More usually, the 
polypeptide of formula I and/or Z, when prepared by, 
e.g., chemical synthesis, is a maximum of about 50 amino 
acids, more typically a maximum of about 40 amino acids, 
and even more typically a maximum of about 3 0 amino 
acids . 

The non-HCV amino acid sequences, L and L' , if 
present, can constitute any of a number types of such • 
sequences. For example, L and L' can represent non-HCV 
sequences to which Z is fused to facilitate recombinant 
expression (e.g., beta-galactosidase, superoxide 
dismutase, invertase, alpha- factor, TPA leader, etc.), as 
discussed below. Alternatively, l and L' can represent 
epitopes of other pathogens, such as hepatitis B virus, 
Bordeizella pertussis, tetanus toxoid, diphtheria, etc.! 
to provide compositions that are immunoreactive relative 
to a number these other pathogens. L and L' can be amino 
acid sequences that facilitate attachment to solid 
supports during peptide synthesis, immunoassay supports, 
vaccine carrier proteins, etc. In fact, L and L' can 
even comprise one or more superfluous amino acids with no 
functional advantage. There is no critical maximum size 
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for L or L' , the length being generally governed by the 
desired function. Typically, L and L' will each be a 
maximum of about 2000 cimino acids, more typically a 
maximum of about 1000 amino acids. The majority of L and 
5 L' sequences with useful properties will be a majximum of 
about 500 amino acids. It is desirable, of course, to 
select L and L' so as to not block the immunoreactivity 
of z. 

The composition of polypeptides provided 
10 according to the present invention are characterized by 
the presence (in an effective amount for 

immunoreactivity) within the composition of at least two 
amino acid sequences defined as follows by formulas II 
and III , respectively: 

15 Ly-Zi-L'y. (II) 

Ly-Z2-L'y. (Ill) 
L, L' , y and y' are defined as above, as well as 
independently defined for each of formulas II and III. 
Zj and Z2 are each HCV amino acid sequences as defined for 

20 z above encompassing the same variable domain (i.e., 
physical location) , but derived from different HCV 
isolates having between them at least one heterogeneous 
epitope in the common variable domain of Zj' and Zj. As an 
illustrative example, an amino acid sequence according to 

25 formula II could have as Zi a fragment the hypervariable 
domain spanning amino acids 3 84-414 of isolate HCV-l (or 
more particularly 396-407 or 396-408), while Z^ is the 
analogous fragment from isolate HCV-Jl.l. These two 
isolates are heterogeneous in this domain, the amino acid 

3 0 sequences of the epitopes varying significantly. 

It is to be understood that the compositions of 
the present invention may comprise more than just two 
discrete amino acid secpaences according to formula I, and 
that the Z sequences may be divided into groups 

35 encompassing different variable domains. For example, a 
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composition according to the present invention could 
comprise a group of HCV sequences (with amino acid 
sequences according to formula I) encompassing the 
hypervariable domain at amino acids 384-411 from isolates 
5 HCV-l, HCV-Ji.i, HC-Jl, HC-J4, etc. The composition 

could also comprise an additional group of HCV sequences 
(within amino acid sequences according to formula I) 
encompassing the variable domain at amino acids 215-255 
also from isolates HCV-i, HCV-Ji.i, HC-Ji, HC-J4, etc. 
10 Within the context of the compositions of the present 
invention, therefore, the sequence of formula I can be 
further defined as follows: 

SV„ (IV) 
V represents an amino acid sequence comprising the 
15 sequence of an HCV variable domain, wherein the variable 
domain comprises at least one epitope; i.e., formula I. 
S and n are integers of 1 or greater. S represents a 
particular variable domain, and n represents a particular 
isolate. For example, S=l could represent the variable 
20 domain at amino acids 384-411; s=2 could represent the 

variable domain at amino acids 215-255; and n=l, 2, 3 and 
4 could represent isolates HCV-l, HCV-Jl.i, HC-Jl and HC-' 
J4, respectively. Thus, the two groups of sequences 
discussed aibove could be represented by: 
2^ Group 1: IV,, IV2, IV3 & IV4 

Group 2: 2V,, 2V2, 2V3 & 2V4 
There are at least two distinct sequences of 
formula IV in the compositions according to the present 
invention; i.e., the composition contains two different 
sequences according to formula IV where the values for S 
and or n are different. For example, at least IV, and IV2 
are present, or at least IV, and 2V2 are present, or at 
least IV, and 2V, are present. 

The distinct sequences falling within formula 
IV are present in the composition either on the same or 
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different polypeptide molecules. Using the miniiniim 
combination of IVj and IV^ to illustrate, these two 
sequences could be present in the same polypeptide 
molecule (e.g., IVt-lVj) or in separate molecules. This 
5 feature of the compositions of the present invention can 
be described as compositions of polypeptides as follows: 

R,- (SVJ,-R',. (^) 
wherein S, V and n are as defined above; R and R' are 
amino acid sequences of about 1-2000 amino acids, and are 
10 the same or different; r and r' are 0 or 1, and are the 
same or different; x is an integer > 1; n is 
independently selected for each x; and with the proviso 
that cimino acid sequences are present in the composition 
representing a combination selected from the group 
15 consisting of (i) IV^ and IV^, (ii) IV^ and 2V2, and (iii) 
IVi and 2Vi. In embodiments where the distinct sequences 
of formula IV are in different polypeptides, x can be 1, 
although it can still be >1 if desired; e.g., a mixture 
of polypeptides IV^-IV^ and IV1-2V2. When x is 1, r and 
20 r' are preferably both 0 to avoid redundancy with Ly and 
L'y., since V can be described by in a preferred 
embodiment by formula I. When x is >1, the combined 
lengths of R and the adjacent L, and of R' " and the 
adjacent L' , are preferably no more than the typical 
25 maximiim lengths described above for L and L' . 

The selection of the HCV amino acid sequences 
included within the distinct V sequences of the 
compositions will depend upon the intended application o: 
the sequences and is within the skill of the art in view 
30 of the present disclosure. First, it should be 

appreciated that the HCV epitopes of concern to the 
present invention can be broken down into two types. Th 
first type of epitopes are those that are "group- 
specific"; i.e., the corresponding epitopes in all or 



35 



wo 93/06126 



-26- 



PCr/US92/07683 



10 



substantially all isolates within an HCV isolate group 
are immunologically cross - reactive with each other, but 
not with the corresponding epitopes of substantially all 
the isolates of another group. Preferably, the epitopes 
in a group- specif ic class are substantially conserved 
within. the group, but not between or among the groups. 
The second type of epitopes are those that are "isolate- 
specific"; i.e., the epitope is immunologically cross- 
reactive with substantially identical isolates, and is 
not cross -reactive with all or substantially all distinct 
isolates . 

These group- and isolate- specif ic epitopes can 
be readily identified in view of the present disclosure. 
First, the sequences of several HCV isolates is compared, 
15 as described herein, and areas of sequence heterogeneity 
identified. The pattern of heterogeneity usually 
indicates group or isolate specificity. If an identified 
area is known to comprise one or more epitopes, then a 
sequence of sufficient size to include the desired 
20 epitope (s) is selected to as an variable domain that may 
be included in the compositions of the present invention. 
If the immunoreactivity of a given heterogeneous area. is 
not known, peptides representing the sequences found in 
that area of the various HCV isolates can be prepared and 
25 screened. Screening can include, but is not limited too, 
immunoassays with various sources of anti-HCV antibody 
(e.g., patient serum, neutralizing Mabs, etc.) or 
generation of antibody and testing the ability of such 
antibody to neutralize virus in vitro. Alternatively, 
the loci of epitopes identified in a screening protocol, 
such as that described below, can be examined for 
heterogeneity among various isolates and the 
immunological properties of corresponding heterogeneous 
sequences screened. 
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For vaccine applications, it is believed that 
variable domains from the El and/or E2/NS1 domains will 
be of particular interest. In particular, an El variable 
domain within amino acids 215-255 (see Figure 2) , and an 
E2/NS1 variable domain within simino acids 384-414 (see 
Figure. 3) , have been identified as being important 
immunoreactive domains. The preliminary evidence 
suggests that one or both of these domains may be loci of 
heterogeneity responsible for escape mutants, leading to 
chronic HCV infections. Thus, polypeptide compositions 
as described aJDOve where the varicible domain (s) in V are 
one or both of these variable domains are particularly 
preferred. Furthermore, the polypeptide compositions of 
the present invention, while particularly concerned with 
15 the generally linear epitopes in the variable domains, 
may also include conformational epitopes. For example, 
the composition can be comprised of a mixture of 
recombinant El and/or E2/NS1 proteins (exhibiting the 
variable domains of different isolates) expressed in a 
20 recombinant system (e.g., insect or mammalian cells) that 
maintains conformational epitopes either inside or 
outside the variable domain. Alternatively, an El and/or 
E2/NS1 subunit antigen from a single isolate that 
maintains conformational epitopes can be combined with a 
25 polypeptide composition according to the present 

invention (e.g., a mixture of synthetic polypeptides or 
denatured recombinant polypeptides) . In another 
preferred application for vaccines, the polypeptide 
compositions described herein are combined with other HCV 
30 subunit antigens, such as those described in commonly 

owned U.S. S.N. . entitled "Hepatitis C Virus 

Asialoglycoproteins" (Attorney Docket No. 0154.002) by 
Robert O. Ralston, Frank Marcus, Kent B. Thudium, Barbara 
Gervase, and John Hall, filed on even date herewith, and 
3 5 incorporated herein by reference. 



wo 93/06.26 _ - PCT/US92/07683 

-28- 



For diagnostic application, it may be useful to 
employ the compositions of the present invention as 
antigens, thereby improving the ability to detect 
antibody to distinct HCV isolates. Typically the 
5 polypeptide mixtures can used directly in a homogeneous 
or heterogeneous immunoassay format, the latter 
preferably comprising immobilizing the polypeptide on a 
solid substrate (e.g., microtiter plate wells, plastic 
beads, nitrocellulose, etc.). See , e.g.. PCT Pub. No. 

10 WO90/110a9; EPO Pub. No. 360,088; IMMUNOASSAY: A 
PRACTICAL GUIDE, supra. Alternatively, each 
substantially identical polypeptide that makes up the 
polypeptide composition of the present invention could be 
immobilized on the same support at discrete loci, thereby 

15 providing information as to which isolate or group the 
antibody has been generated. This may be particularly 
important in diagnostics if various isolates cause 
hepatitis, cancer or other diseases with different 
clinical prognoses. A preferred format is the Chiron 

20 RIBA™ strip immunoassay format, described in commonly 
owned U.S. S.N, 07/138,894 and U.S. S.N. 07/456,637, the 
disclosures of which are incorporated herein by 
reference. 

Polypeptides useful in the manufacture of the 
25 compositions of the present invention can be made 
recombinantly , synthetically or in tissue culture. 
Recombinant polypeptides comprised of the truncated HCV 
sequences or full-length HCV proteins can be made up 
entirely of HCV sequences (one or more epitopes, either 
30 contiguous or noncontiguous) , or sequences in a fusion 
protein. In fusion proteins, useful heterologous 
sequences include sequences that provide for secretion 
from a recombinant host, enhance the immunological 
reactivity of the HCV epitope (s), or facilitate the 
35 coupling of the polypeptide to a support or a vaccine 
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carrier. See, e.g., EPO Pub. No. 116,201; U.S. Pat. No. 
4,722840; EPO Pub. No. 259,149; U.S. Pat. No. 4,629,783, 
the disclosures of which are incorporated herein by 
reference. 

5 Full length as well as polypeptides comprised 

of truncated HCV seq[uences, and mutants thereof, may be 
prepared by chemical synthesis. Methods of preparing 
polypeptides by chemical synthesis are Icnown in the art. 
They may also be prepared by recombinant technology. A 
10 DNA sequence encoding HCV-1, as well as DNA sequences of 
variable regions from other HCV isolates have been 
described and/or referenced herein. The availability of 
these sequences permits the construction of 
polynucleotides encoding immune reactive regions of HCV 

15 polypeptides. 

Polynucleotides encoding the desired 
polypeptide comprised of one or more of the 
immunoreactive HCV epitope from a variable domain of HCV 
may be chemically synthesized or isolated, and inserted 
20 into an expression vector. The vectors may or may not 
contain portions of fusion sequences such as beta- 
Galactosidase or superoxide dismutase (SOD) . Methods and 
vectors which are useful for the production of 
polypeptides which contain fusion sequences of SOD are 
25 described in European Patent Office Publication number 
0196056, pxiblished October 1, 1986. 

The DNA encoding the desired polypeptide, 
whether in fused or mature form and whether or not 
containing a. signal sequence to permit secretion, may be 
30 ligated into expression vectors suitable for any 

convenient host. The hosts are then transformed with the 
expression vector. Both eukaryotic and prokaryotic host . 
systems are presently used in forming recombinant 
polypeptides, and a summary of some of the more common 
35 control systems and host cell lines is presented infra. 
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The host cells are incubated under conditions which allow 
expression of the desired polypeptide. The polypeptide 
is then isolated from lysed cells or from the culture 
medium and purified to the extent needed for its intended 
5 use. 

The general techniques used in extracting the 
HCV genome from a virus, preparing and probing DNA 
libraries, sequencing clones, constructing expression 
vectors, transforming cells, performing immunological 

10 assays such as radioimmunoassays and ELISA assays, for 

growing cells in culture, and the like, are known in the 
art. (See, e.g., the references cited in the "Background- 
section, above, as well as the. references cited at the 
beginning of this ("Modes of Practicing the Invention"_ 

15 section above. 

Transformation of the vector containing the 
desired sequence into the appropriate host may be by any 
known method for introducing polynucleotides into a host 
cell, including, for example, packaging the 

20 polynucleotide in a virus and transducing the host cell 
with the virus, or by direct uptake of the 
polynucleotide. The transformation procedure . used 
depends upon the host to be transformed. Bacterial 
transformation by direct uptake generally employs 

25 treatment with calcium or rxibidium chloride (Cohen 
(1972), Proc. Natl. Acad. Sci. USA 61:2110. Yeast 
transformation by direct uptake may be carried out using 
the method of Hinnen et al . (1978), J. Adv. Enzyme 
Reg. 7: 1929. Mammalian transformations by direct uptake 

30 may be conducted using the calcium phosphate 

precipitation method of Graham and Van der Eb (1978) , 
Virology 52:546, or the various Icnown modifications 
thereof. Other methods for the introduction of 
recombinant polynucleotides into cells, particularly into 

35 mammalian cells, which are known in the art include 
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dextran mediated transf ection, calcium phosphate mediated 
transf ection, polybrene mediated transf ection, protoplast 
fusion, electroporation, encapsulation of the 
polynucleotide (s) in liposomes, and direct microinjection 
5 of the polynucleotides into nuclei. 

In order to obtain expression of desired coding 
sequences, host cells are transformed with 
polynucleotides (which may be expression vectors) , which 
are comprised of control sequences operably linked to the 
10 desired coding sequences. The control sequences are 

compatible with the designated host. Among prokaryotic 
hosts, E. coli is most frequently used. Expression 
control sequences for prokaryotes include promoters, 
optionally containing operator portions, and ribosome 
15 binding sites. Transfer vectors compatible with 

prokaryotic hosts are commonly derived from, for example, 
pBR322, a plasmid containing operons conferring 
ampicillin and tetracycline resistance, and the various 
pUC vectors, which also contain sequences conferring 
20 antibiotic resistance markers. Promoter sequences may be 
naturally occurring, for example, the S- lactamase 
(penicillinase) (Weissman (1981), "The cloning of » 
interferon and other mistakes" in Interferon 3 (ed. I. 
Gresser) , lactose (lac) (Chang et al . (1977), Nature 
25 198:1056) and tryptophan (trp) (Goeddel et al . (1980), 

Nucl. Acids Res. 8.:4057), and lambda -derived Pl promoter 
system and N gene ribosome binding site (Shimatake et al . 
(1981) , Nature 292 :128) . In addition, synthetic 
promoters which do not occur in nature also function as 
30 bacterial promoters. For example, transcription 

activation sequences of one promoter may be joined with 
the operon sequences of another promoter, creating a 
synthetic hybrid promoter (e.g., the tac promoter, which 
is derived from sequences of the trp and lac promoters 
35 (De Boer et al . (1983), Proc. Natl. Acad. Sci. USA 
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M:2l) . The foregoing systems are particularly 
compatible with E. coli; if desired, other prokaryotic 
hosts such as strains of Bacillus or Pseudomonas may be 
used, with corresponding control sequences. 

Eukaryotic hosts include yeast and mammalian 
cells in culture systems. Sacch^^nmyn^<. cerevisiao and 
SaccharomYces carlsbergensi s are the most commonly used 
yeast hosts, and are convenient fungal hosts. Yeast 
compatible vectors generally carry markers which permit 
selection of successful transf ormants by conferring 
prototropy to auxotrophic mutants or resistance to heavy 
metals on wild- type strains. Yeast compatible vectors 
may employ the 2 micron origin of replication (Broach et 
al. (1983), Meth. Enz. 1^1:307), the combination of CEN3 
15 and ARSl or other means for assuring replication, such as 
sequences which will result in incorporation of an 
appropriate fragment into the host cell genome. Control 
sequences for yeast vectors are known in the art and 
include promoters for the synthesis of glycolytic enzymes 
20 (Hess et al . (1968), J. Adv. Enzyme Reg. 7:149); for 

example, alcohol dehydrogenase (ADH) (E.P.O. Pxiblication 
No. 284044), enolase, glucokinase, glucose- 6 -phosphate 
isomerase, glyceraldehyde- 3 -phosphate dehydrogenase (GAP 
or GAPDH) , hexokinase, phosphof ructokinase , 3- 
25 glycerophosphate mutase, and pyruvate kinase (PyK) (E.P.O. 
Publication No. 329203). The yeast PH05 gene, encoding 
acid phosphatase, also provides useful promoter 
sequences. In addition, synthetic promoters which do not 
occur in nature also function as yeast promoters. For 
30 example, upstream activating sequences (UAS) of one yeast 
promoter may be joined with the transcription activation 
region of another yeast promoter, creating a synthetic 
hybrid promoter. Examples of such hybrid promoters 
include the ADH regulatory sequence linked to the GAP 
35 transcription activation region (U.S. Patent Nos . 
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4,876,197 and 4,880,734). Other examples of hybrid 
promoters include promoters which consist of the 
regulatory sequences of either the ADH2 , GAL4, GALIO, or 
PH05 genes, combined with the transcriptional activation 
5 region of a glycolytic enzyme gene such as GAP or PyK 
(E.P.O. P\iblication No. 164556) , Furthermore, a yeast 
promoter can include naturally occurring promoters of 
non-yeast origin that have the ability to bind yeast RNA 
polymerase for the appropriate initiation of 
10 transcription. 

Other control elements which may be included in 
the yeast expression vector are terminators (e.g., from 
GAPDH, and from the enolase gene (Holland (1981), J. 
Biol. Chem. 256 : 1385) , and leader sequences. The leader 
15 sequence fragment typically encodes a signal peptide 
comprised of hydrophobic amino acids which direct the 
secretion of the protein from the cell. DNA encoding 
suitable signal sequences can be derived from genes for 
secreted yeast proteins, such as the yeast invertase gene 
20 (E.P.O- Publication No. 12,873) and the a- factor gene 

(U.S. Patent No. 4,588,684). Alternatively, leaders of 
non-yeast origin, such as an interferon leader, also 
provide for secretion in yeast (E.P.O. Publication No. 
60057) . A preferred class of secretion leaders are those 
25 that employ a fragment of the yeast or- factor gene, which 
contains both a "pre" signal sequence, and a "pro" 
region. The types of o;- factor fragments that can be 
employed include the full-length pre -pro a- factor leader, 
as well as truncated a- factor leaders (U.S. Patent Nos . 
30 4,546,083 and 4,870,008; E.P.O. Publication No. 324274. 

Additional leaders employing an o;- factor leader fragment 
that provides for secretion include hybrid a- factor 
leaders= made with a pre- sequence of a first yeast, but a 
pro- region from a second yeast of-factor. (See, e.g., 
35 P.C.T. WO 89/02463) . 
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Expression vectors, either extrachromosomal 
replicons or integrating vectors, have been developed for 
transformation into many yeasts. For example, expression 
vectors have been developed for Candida albicans (Kurtz 
et al. (1986), Mol . Cell Biol. 6:142), Candida maltsga 
(Kunze et al . (1985) J. Basic Microbiol. 25:141), 
Hanzenula polvmorpha (Gleeson et al . (1986), J. Gen. 
Microbiol. 132:3459), KluwPT-omY^oo fraaili^ (Das et al . 
(1984), J. Bacterid. il8:ll65), IQiiiSiaromicces lactis (De 
Louvencourt et al . (1983), J. Bacteriol . 154:737), Pichia 
quillerimondii , (Kunze et al . (1985), supra), Pichia 
pastoris (Cregg et al . (1985), Mol. Cell. Biol. 5:3376; 
U.S. Patent Nos . 4,837,148 and 4,929,555)), 
SchizosaccharomYces pombe (Beach and Nurse (1981), Nature 
3M:706), and Yarrowia lipolvrir^ (Davidow et al . (1985), 
Curr. Genet. i0:39) . 

Mammalian cell lines available as hosts for 
expression are known in the art and include many 
imrr.-rtalized cell lines available from the American Type 
Culture Collection (ATCC) , including, for example, HeLa 
cells, Chinese hamster ovary (CHO) cells, baby hamster > 
kidney (BHK) cells, COS monkey cells, and a number of 
other cell lines. Suitable promoters for mammalian cells 
are also known in the art and include viral promoters 
25 such as that from. Simian Virus 40 (SV40) . Rous sarcoma 

virus (RSV) , adenovirus (ADV) and bovine papilloma virus 
(BPV) (See, Sambrook (19 89) for examples of suitable 
promoters) . Mammalian cells may also require terminator 
sequences and poly A addition sequences; enhancer 
sequences which increase expression may also be included, 
and sequences which cause amplification of the gene may 
also be desirable. These sequences are known in the art. 

Vectors suitable for replication in mammalian 
cells are known in the art, and may include viral 
replicons, or sequences which ensure integration of the 
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appropriate secpjences encoding the desired polypeptides 
into the host genome. 

A vector which is used to express foreign DNA 
and which may be used in vaccine preparation is Vaccinia 
5 virus. In this case, the heterologous DNA is inserted 
into the Vaccinia genome. Techniques for the insertion 
of foreign DNA into the vaccinia virus genome are known 
in the art, and utilize, for example, homologous 
recombination. The insertion of the heterologous DNA is 

10 generally into a gene which is non-essential in nature, 
for example, the thymidine kinase gene (tk) , which also 
provides a selectable marker. Plasmid vectors that 
greatly facilitate the construction of recombinant 
viruses have been described {see, for example, Mackett et 

15 al. (1984) in "DNA Cloning", Vol. II. IRL Press, p. 191, 
Chakrabarti et al . (1985), Mol . Cell Biol. 5:3403; Moss 
(1987) in "Gene Transfer Vectors for Mammalian Cells" 
(Miller and Calos, eds . , p. 10). Expression of the 
desired polypeptides comprised of immunoreactive regions 

20 then occurs in cells or individuals which are infected 
and/or immunized with the live recombinant vaccinia 
virus . 

Other systems for expression of polypeptides 
include insect cells and vectors suitable for use in 

25 these cells. These systems are known in the art, and 

include, for example, insect expression transfer vectors 
derived from the baculovirus Autoarapha calif ornica 
nuclear polyhedrosis virus (AcNPV) , which is a helper- 
independent, viral expression vector. Expression vectors 

3 0 derived from this system usually use the strong viral 
polyhedron gene promoter to drive expression of 
heterologous genes. Currently the most commonly used 
transfer vector for introducing foreign genes into AcNPV 
is pAc373. Many other vectors, known to those of skill 

35 in the art; have also been designed for improved 
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expression. These include, for example, pVL985 (which 
alters the polyhedron start codon from ATG to ATT, and 
which introduces a BamHI cloning site 32 basepairs 
downstream from the ATT; See Luckow and Summers (1989), 
Virology 17:31. Good expression of nonfused foreign 
proteins usually requires foreign genes that ideally have 
a short leader sequence containing suitable translation 
initiation signals preceding an ATG start signal. The 
plasmid also contains the polyhedron polyadenylation 
signal and the ampicillin-resistance (asr^) gene and 
origin of replication for selection and propagation in E. 
coli . 

Methods for the introduction of heterologous 
DNA into the- desired site in the baculovirus are known in 

15 the art. (See Summers and Smith, Texas Agricultural 

Experiment Station Bulletin No. 1555; Ju et al . (1987), 
in "Gene Transfer Vectors for Mammalian Cells (Miller and 
Calos, eds.); Smith et al . (19 83), Mol . & Cell. Biol. 
3.:2156; and Luckow and Summers (1989) , supra) . For 

20 example, the insertion can be into a gene such as the 

polyhedron gene, by homologous recombination; insertion- 
can also be into a restriction enzyme site engineered 
into the desired baculovirus gene. The inserted 
sequences may be those which encode all or varying 

25 segments of the desired HCV polypeptides including at 
least one epitope from a variable domain. 

The signals for posttranslational 
modifications, such as signal peptide cleavage, 
proteolytic cleavage, and phosphorylation, appear to be 

30 recognized by insect cells. The signals required for 
secretion and nuclear accumulation also appear to be 
conserved between the invertebrate and vertebrate cells . 
Examples of the signal sequences from vertebrate cells 
which are effective in invertebrate cells are known in 

35 the art, for example, the human interleukin 2 signal 
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(IL2J which is a signal for transport out if the cell, 
is recognized and properly removed in insect cells. 

It is often desirable that the polypeptides 
prepared using the above host cells and vectors be fusion 
5 polypeptides. As with non- fusion polypeptides, fusion 
polypeptides may remain intracellular after expression. 
Alternatively, fusion proteins can also be secreted from 
the cell into the growth medium if they are comprised of 
a leader sequence fragment. Preferably, there are 

10 processing sites between the leader fragment and the 

remainder of the foreign gene that can be cleaved either 
in vivo or in vitro . 

In cases where the composition is to be used 
for treatment of HCV, it is desirable that the 

15 composition be immunogenic. In instances wherein the 

synthesized polypeptide is correctly configured so as to 
provide the correct epitope, but is too small to be 
immunogenic, the polypeptide may be linked to a suitable 
carrier. A number of techniques for obtaining such 

20 linkage are known in the art, including the formation of 
disulfide linkages using N-succinimidyl-3 - (2-pyridyl- 
thio) propionate (SPDP) and succinimidyl 4-(N- 
maleimidomethyl) cyclohexane-l-carboxylate CSMCC) (if the 
peptide lacks a sulfhydryl group, this can be provided by 

25 addition of a cysteine residue,) These reagents create a 
disulfide linkage between themselves and peptide cysteine 
resides on one protein and an amide linkage through the 
e- amino on a lysine, or other free amino group in other 
amino acids. A variety of such disulfide/amide- forming 

3 0 agents are known. See, for example, Immun. Rev. (19 82) 
62:185. Other bifunctional coupling agents for a 
thioether rather than a disulfide linkage. Many of these 
thio- ether- forming agents are commercially available and 
include reactive esters of 6 -maleimidocaproic acid, 2- 

35 bromoacetic acid, 2-iodoacetic acid, 4 - (N-maleimido- 
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methyl ) cyclohexane- 1- carboxylic acid, and the like. The 
carboxyl groups can be activated by combining them with 
succinimide or 1 -hydroxyl- 2 -nitro- 4 -sulfonic acid, sodium 
salt. Additional methods of coupling antigens employ the 
5 rotavirus/ "binding peptide" system described in EPO 

Publication No. 259,149. The foregoing list is not meant 
to be exhaustive, and modifications of the named 
compounds can clearly be used. 

Any carrier may be used which does not itself 

10 induce the production of antibodies harmful to the host. 

Suitable carriers are typically large, slowly metabolized 
macromolecules such as proteins; polysaccharides such as 
latex f unctionalized sepharose, agarose, cellulose, 
cellulose beads and the like; polymeric amino acids, such 

15 as polyglutamic acid, polylysine, and the like; amino 
acid copolymers; and inactive virus particles (see 
infra.). Especially useful protein substrates are seriim 
albumins, keyhole limpet hemocyanin, immunoglobulin 
molecules, thyroglobulin, ovalbumin, tetanus toxoid, and 

20 other proteins well known to those of skill in the art. 

The immunogenicity of the epitopes of the HCV , 
variable domains, particularly of El and E2/NS1, may also 
be enhanced by preparing them in eukaryotic systems fused 
with or assembled with particle- forming proteins such as, 

25 for example, that associated with hepatitis B surface 
antigen. See, e.g., U.S. Patent No. 4,722,840. 
Constructs wherein the polypeptide containing the HCV 
epitope from a variable domain is linked directly to the 
particle- forming protein coding sequences produces 

30 hybrids which are immunogenic with respect to the HCV 
epitope. In addition, all of the vectors prepared 
include epitopes specific to HBV, having various degrees 
of immunogenicity, such as, for example, the pre-S 
peptide. Thus, particles constructed from particle 
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forming protein which include HCV sequences are 
immunogenic with respect to HCV and HBV. 

Hepatitis surface antigen (HBSAg) has been 
shown to be formed and assembled into particles in S. 
5 cerevisiae (Valenzuela et al . (1982), Nature 298 : 344 . as 
well as in, for example, mammalian cells (Valenzuela et 
al. (1984), in "Hepatitis B", Millman I. et al . , ed.). 
The formation of such particles has been shown to enhance 
the immunogenicity of the monomer subunit. The 

10 constructs may also include the immunodominant epitope of 
HBSAg, comprising the 55 amino acids of the presurface 
(pre-S) region. Neurath et al • (1984). Constructs of 
the pre-S-HBSAg particle expressible in yeast are 
disclosed in E.P.O. Publication No. 174,444; hybrids 

15 including heterologous viral sequences for yeast 

expression are disclosed in E.P.O. Publication No. 
175,261. These constructs may also be expressed in 
mammalian cells such as CHO cells using an SV40- 
dihydrof olate reductase vector (Michelle et al. (19 84)). 

20 In addition, portions of the particle- forming 

protein coding sequence may be replaced with codons 
encoding an epitope from an HCV variable domain. In this 
replacement, regions which are not required to mediate 
the aggregation of the units to form immunogenic 

25 particles in yeast or mamnals can be deleted, thus 
eliminating additional HBV antigenic sites from 
competition with the HCV epitope (s). 

The preparation of vaccines which contain an 
immunogenic polypeptide (s) as an active ingredient (s) is 

30 known to one skilled in the art. Typically, such 

vaccines are prepared as injectables, either as liquid 
solutions or suspensions; solid forms suitable for 
solution in, or suspension in, liquid prior to injection 
may also be prepared. the preparation may also be 

35 emulsified, or the polypeptide (s) encapsulated in 
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liposomes. The active immunogenic ingredients are often 
mixed with excipients which are pharmaceutically 
acceptable and compatible with the active ingredient. 
Suitable excipients are, for example, water, saline, 
5 dextrose, glycerol, ethanol, or the like and combinations 
thereof. In addition, if desired, the vaccine may 
contain minor amounts of auxiliary substances such as 
wetting or emulsifying agents, pH buffering agents, 
and/or adjuvants which enhance the effectiveness of the 

10 vaccine. Examples of adjuvants which may be effective 
include, but are not limited to: aluminum hydroxide, N- 
acetyl-muramyl-L- threonyl-D-isoglutamine (thr-MDP) , N- 
acetyl-nor-muramyl-L-alanyl-D-isoglutamine (CGP 11637) , 
referred to as nor-MDP) , N-acetylmuramyl -L-alahyl -D- 

15 isoglutaminyl-L- alanine- 2- (1' -2' -dipalmitoyl - sn-glycero- 
3 -hydroxyphosphoryloxy) - ethylamine (CGP 19835A, referred 
to as MTP-PE, and RIBI, which contains three components 
extracted from bacteria, monophosphoryl lipid A, 
trehalose dimycolate and cell wall skeleton (MPL+TDM+CWS) 

20 in a 2* squalene/Tween 80 emulsion. The effectiveness of 
an adjuvant may be determined by measuring the amount of 
antibodies directed against an immunogenic polypeptide 
containing an HCV epitope from a variable domain, the 
antibodies resulting from administration of this 

25 polypeptide in vaccines which are also comprised of the 
various ad j uvants . 

The proteins may be formulated into the vaccine 
as neutral or salt forms. Pharmaceutically acceptable 
salts include the acid addition salts (formed with free 

30 amino groups of the peptide) and which are formed with 
inorganic acids such as, for example, hydrochloric or 
phosphoric acids, or organic acids such as acetic, 
oxalic, tartaric, maleic, and the like. Salts formed 
with the free carboxyl groups may also be derived from 

35 inorganic bases such as, for example, sodium, potassium, 
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arnmonium, calcixim, or ferric hydroxides, and such organic 
bases as isopropylamine, trimethylamine , 2-ethylamino 
ethanol, histidine, procaine, and the like. 

The vaccines are conventionally administered 
5 parenterally , by injection, for example, either 
subcutaneous ly or intramuscularly. Additional 
formulations which are suitable for other modes of 
administration include suppositories and, in some cases, 
oral formulations. For suppositories, traditional 

10 binders and carriers may include, for example, 

polyalkylene glycols or triglycerides ; such suppositories 
may be formed from mixtures containing the active 
ingredient in the range of 0.5% to 10%, preferably l%-2%. 
Oral formulations include such normally employed 

15 excipients as, for example, pharmaceutical grades of 
mannitol, lactose, starch, magnesium stearate, sodium 
saccharine, cellulose, magnesium carbonate, and the like. 
These compositions take the form of solutions, 
suspensions, tablets, pills, capsules, sustained release 

20 formulations or powders and contain 10%-95% of active 
ingredient, preferably 25% -70%. 

In addition to the above, it is also possible • 
to prepare live vaccines of attenuated microorganisms 
which express recombinant polypeptides of the HCV antigen 

25 sets. Suitable attenuated microorganisms are known in 

the art and include, for example, viruses (e.g., vaccinia 
virus) as well as bacteria. 

The vaccines are administered in a manner 
compatible with the dosage formulation, and in such 

30 amount as will be prophylactically and/or therapeutically 
effective. The quantity to be administered, which is 
generally in the range of 5 ^lg to 250 fig of antigen per 
dose, depends on the subject to be treated, capacity of 
the subject's immune system to synthesize antibodies, and 

35 the degree of protection desired. Precise amounts of 
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active ingredient required to be administered may depend 
on the judgment of the practitioner and may be peculiar 
to each individual . 

The vaccine may be given in a single dose 
5 schedule, or preferably in a multiple dose schedule. A 
multiple dose schedule is one in which a primary course 
of vaccination may be with 1-10 separate doses, followed 
by other doses given at subsequent time intervals 
required to maintain and/or reenforce the immune 
10 response, for example, at 1-4 months for a second dose, 

and if needed, a subsequent dose(s) after several months. 
The dosage regimen will also, at lest in part, be 
determined by the need of the individual and be dependent 
upon the judgment of the practitioner. 
15 In addition, the vaccine containing the antigen 

sets comprised of HCV polypeptides described above, may 
be administered in conjunction with other 
immunoregulatory agents, for example, immune globulins. 

The compositions of the present invention can 
-20 . be. administered to individuals to generate polyclonal 
antibodies (purified or isolated from serum using 
conventional techniques) which can then be used in a 
number of applications. For example, the polyclonal 
antibodies can be used to passively immunize an 
25 individual, or as immunochemical reagents. 

In another embodiment of the invention, the - 
above -described immunoreactive compositions comprised of 
a plurality of HCV antigen sets are used to detect 
anti-HCV antibodies within biological samples, including 
30 for example, blood or serum samples. Design of the 

immunoassays is subject to a great deal of variation, and 
a variety of these are known in the art. However, the 
immunoassay will use antigen sets wherein each antigen 
set consists of a plurality of substantially identical 
3 5 polypeptides comprising the amino acid sequence of an 
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epitope within a first variable domain of an HCV isolate, 
and the amino acid sequence of one set is heterogeneous 
with respect to the amino acid sequence of at least one 
other set. Protocols for the immunoassay may be based, 
5 for example, upon competition, or direct reaction, or 
sandwich type assays. Protocols may also, for example, 
use solid supports, or may be by immunoprecipitation . 
Most assays involve the use of labeled antibody or 
polypeptide; the labels may be. for example, fluorescent. 
10 chemiluminescent, radioactive, or dye molecules. Assays 
which amplify the signals from the probe are also known; 
examples of which are assays which utilize biotin and 
avidin, and enzyme- labeled and mediated immunoassays, 

such as ELISA assays. 
15 Kits suitable for iirrnunodiagnosis and contain- 

ing the appropriate labeled reagents are constructed by 
packaging the appropriate materials, including the 
compositions of the invention containing HCV epitopes 
from variable domains, in suitable containers, along with 
the remaining reagents and materials (for example, 
suitable buffers, salt solutions, etc) required for the 
conduct of the assay, as well as a suitable set of assay 

instructions . 

Described below are examples of the present 
25 invention which are provided only for illustrative 
purposes, and not to limit the scope of the present 
invention. In light of the present disclosure, numerous 
embodiments within the scope of the claims will be appar- 
ent to those of ordinary skill in the art. 
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Examples 

In the Examples the following materials and 
methods were used. 

Patient Samples and RNA Exfr?ict inn 

Asymptomatic HCV carriers HCT 18 and HCV Ji and 
chronically infected HCV patient Th have been previously 
described in Weiner et al . (1991) Virol . 180:842-848. 
Patient Q was diagnosed with chronic active hepatitis 
based on a liver biopsy and was placed on alfa-2b 
interferon therapy (3 million units, thrice weekly) for 
six months. RNA from 0.2 ml of plasma was extracted 
according to the method of Chomcyhski and Sacchi, (19 87) 
Anal. Biochem. 162:156-159, using RNAzol™ B reagent 
(Cinna/Biotecx Laboratories) containing lO ;ig/ml MS2 
carrier RNA (Boehringer Mannheim, 165-948) as indicated 
by the manufacturer. RNA was resuspended in 200 fxl of 
diethyl pyrocarbonate treated distilled water and 
reprecipitated in a final concentration of 0.2M sodium 
acetate and two and one half volumes of 100% ethanol 
20 (-20<»C) . 

> 

cDNA an d Polymerase Chain Reactions 

All reactions were performed according to 
Weiner et al. (199 0) Lancet 115:1-5. M13 sequencing was 
25 performed according to Messing et al . (1983), Methods in 
Enzymology 101:20-37. The consensus sequence of at least 
four cloned inserts are presented with the exception of 
the HCV J1.2 E2/NS1 sequence which was derived from two 
clones . 

30 Cloning and sequencing of HCT 18 and Th was as 

reported in Weiner et al . (1991), supra. Nested PCR 
primers used to clone the amino terminal and carboxy 
proximal segments of E2/NS1 in patient Q were: 
PCR I 

35 X(E2)14 GGTGCTCACTGGGGAGTCCT( 13 67-1386)5 
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X{E2) 18J CATTGCAGTTCAGGGCCGTGCTA(1608-1588) A, 

PGR II 

X(E2)4 TCCATGGTGGGGAACTGGGC{ 1406-1425)3 
X(E2)19J TGCCAACTGCCATTGGTGTT{1582-1562) A; 

5 PGR I 

X(E2)14 (above) S 

Jlrcl2 TAACGGGCTGAGCTCGGA(2313-2296) A 

PGR II 

US ( E2 ) 5 CAATTGGTTCGGTTGTACG (19 60-1978)3 
10 Jlrcl3 GGTCCAGTTGGAGGCAGCTTG (2260-224 0) A. 

PGR primers used to clone the HGV Jl E2/N31 gene were: 
PGR I 

J1(E2)14 (above) 3 

Jl(E2)rc30" GAGGGGAGTATGTGGGACTG (2349-2330) A 
15 JlIZ - 2' TGAGAGGGAGGTGGTGGTGGT (1960-1978)3 

Jl(E2)rc32** TTTGATGTACCAGGGGGCGGA ( 2 6 5 8 - 2 6 3 6 ) A 
PGR II-E2384.5' 

GGATGGGGTAGCGATAGGGGCGTGAGGGGGGGGGTGCAA ( 1469 - 
1495)3 

20 DSGONIJBX* 

GGATGGTGTAGATTAGTGTTCTGAGGTATGGGTGTGCTGGAAGTC 

AGA(2272-2301) A 

JlIZ-l' GAAGTGGTTGGGGTGTAGA(1915-1935)S 
Jl(E2)rc31** (2566-2546) A. 
25 *, nt sequence from Takeuchi et al., (1990) Nucl. -Acids 
Res. 18:4626; nt sequence from Kato et al . , (1989) 
Proc. Jpn. Acad. £5B:219-223. Sense (S) or antisense (A) 
PGR primers are given in the 5' to 3' orientation 
according nucleotide numbers in reference. 

30 

Synthesis of Biotinylated Peptides 
The overlapping octapeptides for the 
hypervariable regions of three strains of HGV were 
synthesized on cleavable- linker , derivatized, 

35 
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polyethylene pins essentially as described by (Maeji ec 
al., (1990) J. Immunol. Methods 134:23-33, was coupled to 
the N- terminus of each peptide. Finally, biotin was 
coupled to the N- terminus using 150 ^tl of a 
5 dimethylf ormamide solution containing 40 mM biotin, 40 mM 
1 -hydroxybenzotriazole (HOBt) , 40 mM 
benzotriazole- 1 - yl -oxy- tris -pyrrlidino-phosphonium 
hexaf luorophosphate (PyBOP, NOVABIOCHEM) and 60 mM 
N-methylmorpholine (NMM) reacting overnight at 20 'C. 
10 After biotinylation, the peptides were 

side- chain deprotected, washed and the peptide from each 
pin was cleaved in 200 ^1 of 0 . IM phosphate buffer (pH 
7.2). Microtitre plates containing the cleaved peptide 
solutions were stored at -20 'C until needed. 



F9 6) were coated with streptavidin by incubating 
overnight at 4*=*C with 0.1 ml/well of a 5 ^ig/ml solution 

20 of streptavidin (Sigma Cat. No. S4762) in 0,1 M carbonate 
buffer at pH 9.6. After removal of the streptavidin 
solution, the wells were washed four times with a 0.1% 
solution of Tween 20 in PBS. Nonspecific binding was 
blocked by incubating each well with 0.2 ml of 2% BSA in 

25 PBS for 1 h at 20°C. The wells were again washed four 
times with PBS/Tween 20. Plates were air-dried and 
stored at ^^C until required. The streptavidin in each 
well was coupled to cleaved peptides by incubation with 
100 ^1 of a 1:100 dilution of cleaved peptide solution 

30 with 0.1% BSA in PBS containing 0.1% sodium azide for 1 h 
at 20®C. After incubation, the plate was washed four 
times with PBS/Tween 20. Each well was incubated with 
100 /il of a suitable dilution of ser\am (diluted with 2% 
BSA in PBS containing 0.1% sodium azide) for 1 h at 20 °C 

35 or overnight at 4*='C followed by four washes with 



' ELISA Testing of Biotinylated Peptides 
Polystyrene plates (Nunc immuno plate maxisorb 
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PBS/Tween 20. Bound antibody was detected by reaction 
for 1 h at 20 °C in 0.1 ml conjugate. This consisted of 
0.25 ml/1 (a saturating level) of horseradish peroxidase- 
labeled goat anti- rabbit IgG (H+L) (Kirkegaard and Perry 
5 Labs, Gaithersburg, MD) in CASS (0.1% sheep serum, 0.1% 
Tween 20, 0.1% sodium caseinate diluted in 0 . IM PBS, pH 
7.2) . The wells were washed 2 times with PBS/Tween 20 
followed by two washes with PBS only. The presence of 
enzyme was detected by reaction for 45 min at 20 *C with 

10 0.1ml of a freshly-prepared solution containing 50 mg of 
ammonium 2,2' -azino-bis [3 - ethylbenzothiazoline- 
6-sulphonate (ABTS, Boehringer Mannheim Cat. no. 122661) 
and 0-03 ml of 35% (w/w) hydrogen peroxide solution in 
100 ml of 0.1 M phosphate/0.08 M citrate buffer, pH 4.0. 

15 Color development was measured in a Titertek Multiscan MC 
plate reader in the dual wavelength mode at 405 nm 
against a reference wavelength of 492 nm. 

Computer Generated Antigenicity Profile 

20 Antigenicity profiles for the HCV E2/NS1 . 

protein and HIV-l gpl20 hypervariable region V3 (aa 303- 
33 8) were derived from a computer program based on the 
degree of sequence variability as originally proposed by 
Kabat [Sequences of proteins of immunological interest. 

25 U.S. Department of Health and Human Services, Public 

Health Service, National Institutes of Health (1983)] for 
the identification of the hypervariable loops of 
immunoglobulins multiplied by the average of the 
individual probability that antibody binding is retained 

30 for each possible pair-wise amino acid. Probabilities 

for retention of antibody binding associated with a given 
amino acid change were the values experimentally 
determined by assessing the effects on antibody binding 
of all possible amino acid substitutions for 103 

35 characterized linear epitopes. Geysen et al . , (1988) J. 
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Mol, Rec. 1:32-41. This algorithm thus weights the 
variability index to give more significance to amino acid 
changes likely to have a significant effect on antibody 
binding, i.e., compensates for conservative amino acid 
5 changes. Fifteen HCV sequences [HCV-l, Q3.2, HCT 23, 

EClO, HC-Jl, HCVEl, TH, HCT 27, Ql . 2 , HCT18, HC-J4, HCV 
J1.2/HCV Jl.l, HCV J , HCV BK] , were used to determine 
the antigenicity profile for HCV. The HIV-l V3 profile 
was obtained by averaging 242 individual profiles of 15 
10 sequences selected at random from the numerically greater 
data base of unique HIV-i sequences. LaRosa et al . , 
(1990) Science 249:932-935 & Correction in Science (1991) 
p. 811. The amino acid sequences of some of these 
isolates between aa 384 and 420 are shown in Figure 3. 

15 

Computer Generated Secondary Structure 
Predictions 

The Qf-helix, )3-sheet and jS-turn secondary 
structure probabilities for the amino- terminal region 

20 (384-420) were determined using an algorithm, which 

assigns the probabilities for each of the three above 
secondary structural motifs to each residue.; The 
coefficients used in the algorithm were obtained for all 
pair-wise combinations of residues of the structural data 

25 base. Levitt and Greer, (1977) J, Mol . Biol. 

114:181-293. The prediction parameters obtained from 
these coefficients were fitted to the observed outcome 
when the algorithm was applied back on the database to 
obtain probabilities that a given residue would be found 

30 in one of the three defined secondary structural motifs. 



35 
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Example 1 

Comparison 'of Secondary Structure and Amino 
Acid Sequence Variation in the HCV E2/NS1 HV 
5 and HIV-l apl20 Domains 

The amino acid sequences from fifteen HCV and 
HIV-l isolates were compared with respect to the number 
of positions at which amino acid sequence heterogeneities 

10 were observed in the HCV E2 HV or HIV-l gpl2 0 V3 domains 
(Figure 4, A and respectively). Amino acid 
heterogeneities occurred in 25 of 30 amino acid positions 
in the E2 HV region and 23 of 35 amino acid positions in 
the HIV-l gpl2 0 V3 domain. Dashes on the x-axis of 

15 Figure 4 A and B represent amino acid positions where 
variable amino acid residues occur and invariant amino 
acids are given in the single letter amino acid code. 
The antigenicity profiles shown in Figure 4 indicate 
that, similar to the V3 loop of the HIV-l gpl20 protein 

20 (Figure 4B) , a block of amino acid residues in the HCV E2 
(amino acids 384-414 in Figure 4A) was identified whose 
variation had a predicted adverse affect on antibody . ' 
binding. The data in figure 4 indicate that the HCV E2 
domain resembles the HIV-l gpl20 V3 domain, which is 

25 known to encode virus neutralizing epitopes, in both the 
degree and predicted significance of observed amino acid 
variation and suggests that the E2 HV domain may have a 
similar function as the gpl20 V3 domain. 

Linear epitopes are more likely associated with 

30 less structured regions of proteins, in particular, the 
ends of proteins or with extended surface loops. A 
computer analysis was used to predict the probability 
that an individual residue is associated with a defined 
secondary structural motif for 15 E2 HV amino acid 

35 sequences between residues 384 to 420. Figure 4 shows 
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that the region between the E2 amino- terminal residue 3 84 
and the strongly predicted, highly conserved beta- turn 
(residues 415-418) is relatively unstructured as 
indicated by less than 5 0 percent probability of 
5 alpha-helix, beta-sheet or beta-turn character. Lack of 
strongly predictive structure in the E2 HV domain is 
consistent with the tolerance for extensive sequence 
variation found between isolates and is in contrast with 
highly structured regions which contribute to tertiary 

1-0 folding of the protein. The HCV E2 HV domain appears to 
be even less structured than the V3 , principal 
neutralizing domain of HIV-l gpl20, which has been 
reported to contain a beta strand- type II beta turn-beta 
strand- alpha helix motif and may have greater structural 

15 constraints on amino acid variability than the HCV E2 HV 
domain. Taken together, the evidence suggests that the 
E2 HV domain appears to have features characteristic of 
protein domains which contain likely sites of linear 
neutralizing epitopes, 

20 . 

Example 2 

Epitope Mappin g of the HCV E2/NS1 HV Domain 

Overlapping biotinylated 8-mer peptides 
corresponding to and extending past the E2/NS1 HV domain 
^ (amino acids 384 to 416) of HCT 18 (A,D) , Th (B,E-) and 
HCV Jl (C,F) were bound to plates coated with 
streptavidin and reacted with plasma from either HCT 18 
(A-C) or Th (D-F) . The results are shown in Figure 6 for 
HCV isolates HCT 18 (Fig. 6A and 6D) , Th {Fig. 6B and 
^° 6E) , and HCV Jl (Fig. 6C and 6F) , HCT 18 plasma was 

diluted 1:200 and Th plasma was diluted 1:500. HVE-1, 
-2, -3, -4 and -5, represent isolate specific epitopes. 

As seen from Figure 6, HCT 18 plasma identi- 
fied a linear epitope C^^PKQNV^^M when tested with 

35 
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peptides derived from the HCTIS sequence (HVE-I in 
Figure 6A) , but failed to react with peptides corres- 
ponding to the HV domain of two different strains Th and 
HCV Jl (Figures 6B and 6C) . In contrast, Th plasma 
5 identified linear epitope HVE-IV in the HV domain of Th 
(^QNIQLI^**, Figure 6E) , and also epitopes in strain HCT 
18 C^XVRFFAP^^, Figure 6D) and HCV Jl • Th, an IV drug 
user, may have been exposed to multiple strains of HCV. 

Both Th and HCT 18 plasma each reacted with an 

10 epitope (amino acids 413-419) common to all three . 

isolates (data not shown) when used in an ELISA with pin 
synthesized overlapping 8mer peptides from each isolate . 

In-order to validate antibody binding 
specificity, antibodies bound to biotinylated peptides 

15 containing amino acids 403-407 were eluated and used to 
block the reactivity of HCT 18 plasma with pins 
containing overlapping, 8 -mers for the HCT 18 HV domain. 
These data indicate that 1) the E2/NS1 HV domain is 
immunogenic, 2) there are multiple epitopes which map to 

20 this region, and 3) a subset of epitopes (HVE-1, -2, -3, 
-4 or -5 in Figure 6) in the HV domain are isolate 
specific. 

Example. 3 

25 Determination that Variant E2/NS1 HV Domains 

Can Be Associated With Flares of Hepatitis 

To investigate the possibility of finding HCV 
variants associated with the intermittent flares of 
hepatitis often found in chronic HCV infections, we 
partially sequenced the E2/NS1 gene from a patient, Q, 
with chronic hepatitis during two distinct episodes of 
hepatitis approximately two years apart (Ql and Q3 , 
respectively) . The second episode of hepatitis occurred 
1.5 years after the termination of interferon treatment. 

35 
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The differences in the deduced amino acid 
sequence of the Ql and Q3 E2/NS1 HV region was strikingly 
different only between amino acids 391-408 with seven of 
eight changes occurring between amino acid 39 8 and 407 
5 (Figure 7) . Figure 7 shows the deduced amino acid 

sequences of two regions of the E2/NS1 polypeptide, amino 
acids 384-414 and 547-647, for the Ql and Q3 isolates. 
The amino acid (E) above the Qi sequence was found in one 
of four Ql clones. The boxed amino acids represent the 
10 location of the Ql or Q3 HVE I2mer peptide. Amino acid 
sequence differences found between Ql and Q3 are printed 
in bold type. 

Only one amino acid heterogeneity was observed 
between amino acids 547 and 647 of the Ql and Q3 E2/NS1 

15 polypeptides (Figure 7) . 

To examine the effect of the amino acid 
substitutions observed in the Ql and Q3 E2 HV domains on 
antibody binding, we synthesized a Ql and Q3 specific 
12-mer peptide from amino acids 396 to 407 (HVE Ql or Q3 

20 in Figure 7B) and separately reacted the Ql and Q3 plasma 
with each peptide in an ELISA. Table 4 shows that 
antibodies in both the Ql and Q3 plasma reacted with the 
Ql peptide but not with the Q3 peptide. Statistical 
analysis (Student's Test) indicated that the binding of 

2S the Q1/Q3 plasma to the Ql peptide was significantly 

above background binding of those plasma to a panel of 12 
randomly chosen control peptides (P<0.001), while binding 
of either the Ql or Q3 plasma to the Q3 peptide was not 
statistically significant. The data indicate that 

3 0 although patient Q developed antibodies to the HCV Ql HV 
domain, which were still detectable two years later at 
the Q3 time point, no detectable humoral response had 
developed to the Q3 E2 HV variant which was predominant 
during the second episode of hepatitis. 
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Table 4 

PI -i t^a Results on 12-mer Peptides 

TARFAGFFQSGA TAGFVRLFETGP 
Plasma Ql seq Q3 seq 

Mean sd Mean sd 

Ql 1.158 0.134 0.691 0.123 

Q3 1.022 0.123 0.693 0.036 



Example 4 

nf>t-. faction of Coexisting E2/NS1 Ge nes With 
n-igrinct E2/NS1 HV Domains in HCV Infected 
TndividualS 

Figure 8A shows the amino acid sequences 
deduced from two isolates of HCV Jl (Jl.l & J1.2) which 
were cloned from one plasma sample of the Japanese 
volunteer blood donor HCV Jl. Kubo at al . , (1989) Nucl . 
Acids Res. 17:10367-10372. Of the 23 total amino acid 
changes between HCV Jl.l and HCV J1.2, 9 differences 
indicated by bold type are clustered in the 30 amino acid 
E2/NS1 HV domain. Five of the 9 amino acid substitutions 
in the E2/NS1 HV domain represent nonconseryative amino 
acid changes. Since HCV Jl is the only group II HCV 
genome which has been cloned in our laboratory, it is 
unlikely that these differences are due to cross contami- 
nation of the HCVJl plasma. The HCV J1.2 sequence 
represents a minority sequence in HCV Jl's blood since 
only two E2/NS1 HV variant sequences were identified from 
7 cloned sequences which originated from two independent 

PCR reactions . 

Interestingly, a comparison of the HCT27 and 
HCV El isolates (Figure 8B) , which were sequenced in 
different laboratories and derive from presumably 
unrelated individuals, showed that the nxamber of amino 
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acid differences in the E2/NS1 HV domain of these 
isolates were fewer than the number of differences 
observed between isolates from the same individual. 

The above described results lead to the 
5 suggestion that the HCV genome is rapidly evolving in 
individuals and the population. 

Example ^ 

Formulation and Preparation nf Vaccina 

10 

Coupling of the Diphtheria To x oid C;^^^ier P^nr^in to MC5 
Materials Required 

ethylene diamine tetra-acetic acid (EDTA Na2.2H20) (MW 

6-maleimido-caproic acid N-hydroxysuccinimide ester (MCS) 
15 (Sigma) - 95% pure 

sodium dihydrogen orthophosphate (NaH2P04) 
nitrogen 

dimethyl formamide (DMF) 
Milli Q water 

0.1 M phosphate buffer containing 5 mM EDTA, pH 6 66 
0.1 M phosphate buffer, pH 8 . 0 
2Q 0.1 M phosphate buffer, pH 7.0 

sodium succinate t {CH2C00Na) j. eH^O] 
cysteine 

hydrochloric acid (2% solution) 

0.1 M sodium succinate/0.1 EDTA, pH 5.6 



25 



30 



Purified diphtheria toxoid (Commonwealth Serum 
Laboratories, Victoria, Australia) was coupled to MCS 
according to the method described by Lee et al . , (19 80) 
Mol. Immunol. 17:749; Partis et al . , (1983) Prot . Chem. 
2:263; Peeters et al . , (1989) J. Immunol. Methodg 
120:133; Jones et al . , (1989) J. Immunol. Methndg 
123:211. 100 ml of diphtheria toxoid was passed through 
a G25 Sephadex column (I7cm X 4 cm) to remove thiomersal. 
The toxoid was eluted with 0.1 M phosphate buffer pH 7.0 
and the protein content of the eluate was assayed using 
the BCA protein determination (Pierce) . The resulting 
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solution was concentrated using an Amicon ultrafiltration 
unit to a final concentration of 10 mg/ml . 

One milliliter of the toxoid solution was 
dialyzed with 0.1 M phosphate buffer, pH 8.0, and then 
5 mixed with a solution of 1.5 mg MCS in 2 00 /xl DMF. The 
resulting solution was incubated at room temperature for 
1 hour in the dark with occasional mixing. In order to 
separate the uncoupled MCS from the MCS -toxoid, the 
solution was passed through a Sephadex PDlO column which 

10 had been equilibrated with 0.1 M phosphate buffer, pH 
6.66 and" the protein fraction was collected. 

The number of maleimido groups coupled per 
carrier molecule was determined prior to coupling of the 
HCV peptides thereto. Thirty milliliters of the 

15 succinate/EDTA buffer was sparged with nitrogen for 2 
minutes. Five milligrams of cysteine was transferred 
into a 25 ml volumetric flask and dissolved in a final 
volume of 25 ml of the sparged buffer. Aliquots of the 
solutions shown in Table 5 were transferred in duplicate 

20 to 25 ml screw capped bottles. Using separate pipettes, 
nitrogen was bubbled into each aliquot. Each bottle was 
then sealed and incubated at room temperature in the dark 
for 40 minutes with occasional swirling. 

Table 5 

25 Solution Sample (ml) Standard (ml) Blank (ml) 

activated carrier- 0.3 

phosphate buffer - 0.3 0.3 

cysteine solution 1.0 1.0 

succinate buffer - - 1.0 

* A 0.1 ml aliquot of each of the 3 solution was taken 
30 for an Ellman's determination. 
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Ellman's Test for the Quant i tat ive Determinatiion of 

Sulf hvdrvl 

Materials Required 

Phosphate buffer, pH 8 . 0 
g Dissolve 15.6 g NaH2P04 or 12,0 g 

NaH2P'04 anhydrous in approximately 700 
ml Milli Q water. Adjust the pH to 
8.0 using 50% NaOH. Add Milli Q water 
for a final volume of 1000 ml and 
then adjust the pH if necessary. 
Ellman's Reagent 

Dissolve 10.0 mg of 5 , 5 ' -dithiobis - 2 - 
10 nitrobenzoic acid (DTNB) in 2,5 ml of phosphate 

buffer, pH 8.0 

0.1 ml of Ellman's reagent was added to each of 
the 0 . 1 ml aliquots of the solutions prepared above, 
namely the sample, standard and bland solutions. Five 

^5 milliliters of phosphate buffer, pH 8.0, was then added 
to each aliquot, mixed well and allowed to stand for 15 
.minutes. The absorbance of each aliquot was measured in a 
1 cm path length cell at 412 nm. 

The number of maleimido groups present on the 

2 0 carrier protein was determined according to the following 
method. A 0.01 /xmol per ml solution of -SH produces an 
absorbance of 0.136 in a 1 cm light path at 412 nm. The 
absorbance of the Standard or Sample (A) is equal to the 
amount of cysteine reacted with the coupled maleimido 

25 groups on the activated carrier protein. Since 1 mol of 
available -SH reacts with 1 mol of maleimido, the 
concentration in /xmols of the maleimido groups present in 
the aliquot tested is equal to A{0 . 01) /0 . 136 ^ol/ml. 
The total volume of the solution was 5.2 ml. Therefore, 

2Q the total number of /xmols present was equal to 

A(0,01) (5.2) /0. 136. The sample solution had a total 
volume of 1.3 ml, of which 0.3 ml consisted of the 
activated carrier protein. The amount of maleimido 
groups present in the sample solution was calculated as 

35 A(0. 01) (5.2) (1.3) / (0.136) (0.1) (0.3) =A(16.57) ^mol/ml. 
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The MCS -activated carrier protein was , stored at -20° C. 

Reduction of the HCV Peptides 

Prior to coupling of the HCV peptides to the 

5 MCS- activated carrier protein, the peptides were reduced 

to ensure that thiol groups present on the peptides were 

in the fully reduced -SH form. 

Materials Required 

dithiothreitol (DTT) 
-j^Q ammonium hydrogen carbonate {NH4HCO3) 
methanol 

SEP-PAKs (CIS cartridge, Waters) , 1 cartridge for each 8 

mg of peptide 
0.1 M ammonium hydrogen carbonate buffer 

Dissolve 7,9 g NH4HCO3 in l L Milli Q 

water 

Buffer A, 0.1% v/v trif luoroacetic acid (TFA) in Milli Q 
15 , water 

Buffer B, ^'60%. v/v acetonitrile , 0.1% v/v TFA in Milli Q 
water 

15 mg of each of two HCV peptides corresponding 
to amino acids 384-411 and 225-260, respectively, of the 
HCV polyprotein were added to 2.5 ml of 0,1 M ammoniijm 
hydrogen carbonate containing a 10 fold molar excess of 
DTT. The resulting solutions were mixed until the 
peptide had dissolved and were then allowed to stand for 
1 hour at room temperature. Two pairs of SEP-PAKs were 
connected in series and activated by passing 
approximately 20 ml of methanol and then 20 ml of Buffer 
A through each pair of SEP-PAKs, Each peptide/DTT sample 
was slowly passed through a pair of SEP-PAKs. The DTT 
was eluted with 20 ml of Buffer A. The reduced peptide 
was eluted with 7 ml of Buffer B into a pre -weighed 
bottle and then freeze-dried overnight. The bottles were 
then weighed to determined the amount of recovered 
peptide. The reduced peptides were then immediately 
coupled to the MCS -activated carrier protein. 



20 



25 



30 



35 



wo 93/06126 



-58- 



PCT/LS92/07683 



Couplincr HCV Peptides to MCS- Activated Carrier Protiein 

Approximately 100 ml of 0 . 1 M phosphate buffer 
with 5 mM EDTA, pH 6.66 was degassed under vacuum and 
then sparged with nitrogen for 10 minutes. Twenty 
5 milliliters of a 10 mg/ml solution of the MCS-activated 
carrier protein was carefully sparged with nitrogen to 
prevent excessive frothing. 5 mg of each of the reduced 
peptides were dissolved in approximately 0.2 ml of the 
degassed sparged phosphate/EDTA buffer, pH 6.66 and then 

10 mixed with the MCS-activated carrier protein solution. 

The resulting mixture was transferred into a screw capped 
bottle which was then filled with nitrogen and sealed. 
The solution was further degassed by holding the bottle 
in a Branson 2000® sonication bath for 2 minutes. The 

15 bottle was covered with aluminum foil and incubated 
overnight at room temperature with slow mixing on a 
shaker table , 

The resultant conjugate was soluble and the 
uncoupled peptide was removed by passing the mixture over 

20 a Sephadex PD 10 column which had been equilibrated with 
the phosphate/EDTA buffer, pH 6.66. The protein fraction 
was collected- The amount of peptide conjugated to the 
carrier protein was determined by amino acid analysis. 

An amino acid analysis of 150 /il aliquots of 

25 both the conjugate aftd the carrier protein was performed. 
The average ratio of the level of amino acids contributed 
solely by the carrier protein was determined to calculate 
the amount of conjugated peptide produced- Levels of 
serine, threonine, tryptophan, methionine, tyrosine and 

30 cysteine were not determined as these amino acids are 
modified under the standard hydrolysis conditions. 
Typical results . obtained in these calculations are 
presented in Table 6. 
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Table 6 



10 



15 



25 



30 



AMINO ACID CARRIER ONLY CONJUGATE 

D 212 193 

E 194 170 

G 153 108 

R 60 56 

A 150 384 

p 79 163 

For the conjugate, the values in bold type are 

the amino acids that were also present in the peptides. 

For conjugates containing alanine and proline, the factor 

(193+179+180+56) / (212) +194+153+60) = 0,8659 is multiplied 

by the amount of the amino acid level in order to 

normalize the result* 



Preparation of Vaccine Composition 

Injectable compositions consisting of HCV 
peptides conjugated to MCS-activated diphtheria toxoid 
carrier protein prepared as described supra and a 
submicron oil-in-water emulsion adjuvant as described in 
PCT International Publication No. WO9014837, published 
December 13, 1990, which is incorporated by reference 
herein. In addition, injectable compositions containing a 
an immunostimulant , lipophilic muramyl peptide (MTP-PE, 
CIBA-GEIGY, Basel, Switzerland) in addition to HCV 
conjugated peptides and adjuvant were prepared. The 
vaccine compositions were generally comprised of 50% 
protein and 50% adjuvant. 



Formula for Vaccine Composition with MTP-PE 

To prepare 10 ml of injectable vaccine composition: 



2 . 5 ml Squalene (Sigma Chemical Co., St. Louis, Mo.) 
0.25 ml Tween 80 (Sigma Chemical Co.) 
0.25 ml, SPAN 85 (Sigma Chemical Co.) 
1000 /xg MTP-PE 

1000 ^JLq HCV peptide conjugated to MCS-activated 
35 diphtheria toxoid carrier protein 
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Formula for Vac cine Composition without MTP-pf 

To prepare 10 ml of injectable vaccine composition: 

t 

2.5 ml Squalene (Sigma Chemical Co., St. Louis, Mo ) 
0.25 ml Tween 80 (Sigma Chemical Co.) 
5 0.25 ml SPAN 85 (Sigma Chemical Co.) 

1000 Mg HCV peptide conjugated to MCS-activated 
diphtheria toxoid carrier protein 



Example fi 

10 Method for Testing Vaccine 

Prepar ations for Toxicity 

Vaccine prepared according to the methodology 
of Example 5 was tested for toxicity in small animals. 
Fifty microgram per kilogram of vaccine was administered 
to guinea pigs, mice and rabbits by intraperitoneal 
injection. The vaccine was also administered by 
intraperitoneal injection to rhesus monkeys and primates. 
Half of the test population of rhesus monkeys and 
primates received 5 /xg/kg doses of the vaccine, while the 
other half received 50 /ig/kg dosages. Control animals 
employed in each of the studies were injected with a 
comparable amount of a composition consisting of the 
components of the vaccine preparation except the viral 
peptides . 

Each of the animals was monitored for symptoms 
indicative of a response to toxic material. More 
specifically, each animal in the study was examined bi- 
weekly for symptoms including fever, lethargy, weight 
loss, changes in eating habits and for lesions, swelling 
or tenderness at the site of injection. Lymph nodes 
proximal to the injection site were also examined for 
swelling and/ or drainage. The animals were monitored on 
a bi-weekly basis for a period of several months. 
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Example 7 

DemQnst:rat:ion of the Production of 
Neutralizinc- Antibody in Vaccinated Animals 

^ Vaccine prepared according to the methodology 

of Example 5 was tested in chimpanzees in order to 
determine the effectiveness of the vaccine in eliciting 
the production of virus neutralizing antibody in 
vaccinated subjects. Chimpanzees were vaccinated with 5 
/xg/kg dosages of vaccine prepared according to the 
methodology of Example 5 over a six-month time period at 
intervals of 0, 1, 3 and 6 months. Control chimpanzees 
were injected with comparable amounts of a composition 
consisting of the components of the vaccine except the 
viral peptides. Two weeks after the last dose of vaccine 
was administered, the test and control chimpanzees were 
each challenged with a 10 CIU50 (Chimpanzee Infectious 
Unit) dose of CDC/910 plasma inoculum. Commencing one 
week following the viral challenge, each of the 
chimpanzees was monitored for viremia on a weekly basis. 

In order to detect viremia, blood samples and 
liver biopsy specimens were collected from control and 
test animals on a weekly basis for several months. 
Tissue collected by liver biopsy was examined 
histologically for signs of necrosis and/or inflammation. 
In addition, hepatocytes from the biopsy material were 
examined by electron microscopy for the presence of 
tubules characteristic of HCV infection. The blood 
samples were also analyzed by the ELISA assay described 
supra for the presence of antibodies to segments of viral 
polypeptides which were not utilized in preparing the 
vaccine. In particular, each of the blood samples was 
screened by ELISA for the presence of antibodies to NS3, 
NS4 and NS5 peptides. The presence of antibodies to 
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these peptides in the serum of a chimpanzee was 
indicative of HCV infection • 

The following method was employed to detect 
viral RNA circulating in plasma or present in liver 
biopsy tissue collected from the chimpanzees. 

cPCR Method to Det ect HCV RNA in I^iver and in Serum 

In the cPCR assay, putative viral RNA in the 
sample is reverse transcribed into cDNA with reverse 
transcriptase; a segment of the resulting cDNA is then 
amplified utilizing a modified version of the PGR 
technique described by Saiki et al. (1986). The primers 
for the cPCR technique are derived from HCV RNA, which 
can be identified by the family of HCV cDNAs provided 
15 herein. Amplified product corresponding to the HCV-RNA 
is detected utilizing a probe derived from the family of 
HCV cDNAs provided herein. 

The cPCR/HCV assay used in these studies was 
performed utilizing the following methods for the 
2 0 preparation of RNA, the reverse transcription of the RNA 
into CDNA, the amplification of specific segments of the- 
cDNA by PGR, and the analysis of the PCR products. 

RNA was extracted from liver utilizing the 
guanidium isothiocyanate method for preparing total RNA 
25 described in Maniatis et al. (1982). 

In order to isolate total RNA from plasma, the 
plasma was diluted five- to ten-fold with TENB (O.l M 
NaCl, 50 mM Tris-HCl, pH 8.0, 1 mM EDTA) and incubated in 
a Proteinase K/SDS solution (0.5% SDS, 1 mg/ml Proteinase 
K, 20 micrograms/ml Poly A carrier) for 60 to 90 minutes 
at 37^C, The samples were extracted once with phenol (pH 
6.5), the resulting organic phase was re-extracted once 
with TENB containing 0.1% SDS, and the aqueous phases of 
both extractions were pooled and extracted twice with an 
35 equal volume of phenol/CHCl^/isoamyl alcohol [1:1(99:1)]. 
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The resulting aqueous phases were extracted with an equal 
volume of ChCl3/isoamyl alcohol (99:1) twice, and ethanol 
precipitated using 0.2 M sodium acetate, pH 6.5, and 2.5 
volumes of 100% ethanol; precipitation was overnight at 
5 -20°C. 

The cDNA used as a template for the PGR re- 
action was prepared utilizing the designated samples for 
preparation of the corresponding cDNAs. Each RNA sample 
(containing either 2 micrograms of heat denatured total 
chimpanzee liver RNA or RNA from 2 microliters of plasma) 
was incubated in a 25 microliter reaction containing i 
micromolar of each primer, 1 millimolar of each 
deoxyribonucleotide triphosphate (dNTP) , 50 millimolar 
Tris-HCL, pH 8.3, 5 millimolar MgCl^ , 5 millimolar 
15 dithiothreitol (DTT), 73 millimolar KCl, 40 units of 
RNase inhibitor (RNASIN) , and 5 units of AMV reverse 
transcriptase. The incubation was for 60 minutes at 
37°C. Following cDNA synthesis, the reactions were 
diluted with 50 microliters of deionized water (DIW) , 
boiled for 10 minutes, and cooled on ice. 

Amplification of a segment of the HCV cDNA was 
performed utilizing two synthetic oligomer 16-mer primers 
whose sequences were derived from HCV cDNA clones 3 6 
(anti-sense) and 37b (sense) . The sequence of the primer 
25 from clone 3 6 was: 

5 ' GCA TGT CAT GAT GTA T 3 ' . 

The sequence of the primer from clone 3 7b was: 

5 ' ACA ATA CGT GTG TCA C 3 ' . 

3 0 The primers were used at a final concentration of i 

micromolar each. in order to amplify the segment of HCV 
CDNA which is flanked by the primers, the cDNA samples 
were incubated with O.i microgram of RNAse A and the PGR 
reactants of the Perkin Elmer Cetus PCR kit (N801-0043 or 

35 N801-0055) according to the manufacturer's instructions. 
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The PGR reaction was performed for either 3 0 cycles or 60 
cycles in a Perkin Elmer Cetus DNA thermal cycler. Each 
cycle consisted of a 1 minute denaturation step at 94^c, 
an annealing step of 2 minutes at 37*^C, and an extension 
5 step of 3 minutes at 7 2^C. However, the extension step 

in the final cycle (30 or 60) was 7 minutes rather than 3 
minutes. After amplification the samples were extracted 
with an equal volume of phenol: chloroform (1:1), 
followed by extraction with an equal volume of 

10 chloroform, and then the samples were precipitated with 
ethanol containing 0.2 M sodium acetate. 

The cPCR products were analyzed as follows. 
The products were subjected to electrophoresis on 1.8% 
alkaline agarose gels according to Murakawa et al. 

15 (1988) , and transferred onto 2ETA® Probe paper (BioRad 
Corp.) by blotting gels overnight in 0.4 M NaOH. The 
blots were neutralized in 2 X SSC (1 X .SSC contains 0.15 
M NaCl, 0.015 M sodium citrate), prehybridized in 0.3 M 
NaCl, 15 mM sodium phosphate buffer, pH 6.8, 15 mM EDTA, 

20 1.0% SDS, 0.5% nonfat milk (Carnation Co.),, and 0.5 mg/ml 

sonicated denatured salmon sperm DNA. The blots to be 

analyzed for HCV cDNA fragments were hybridized to a 
32 

P-labeled probe generated by nick translation of the 
HCV cDNA insert sequence in clone 35, described in 

25 U.S. S.N. 07/456,637. After hybridization, the blots were 
washed in 0.1 X SSC (1 X SSC contains 0.15M NaCl, O.OIM 
Na citrate) at 65*^C, dried, and autoradiographed. The 
expected product size is 58 6 nucleotides in length; 
products which hybridized with the probe and migrated in 

3 0 the gels in this size range were scored as positive for 
viral RNA. 

As a control, cPCR primers designed to amplify 
alpha-1 anti-trypsin mRNA was performed to verify the 
presence of RNA in each sample analyzed. The coding 
35 region of the alpha-1 anti-trypsin gene is described in 
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Rosenberg et al. (1984) . Synthetic oligomer 16-iner prim- 
ers designed to amplify a 3 65 nucleotide fragment of the 
coding region of the alpha-1 antitrypsin gene were 
derived from nucleotides 22-37 (sense) and nucleotides 
5 372-3 87 (antisense) . The PGR products were detected 

using a "^^P nick-translated probe which lies between, and 
not including ; the cDNA/PCR primer sequences. 

Due to the extreme sensitivity of the PGR re-r 
action, all samples were run a minimum of three times. 

10 All false positive signals were eliminated when the fol- 
lowing precautions were taken: 1) eliminating aerosols by 
using screw capped tubes with rubber O-ring seals; 2) 
pipetting with Ranin MIGROMAN® positive displacement 
pipetters with disposable pistons/capillaries; and 3) 

15 selecting the oligonucleotide sequences for the cDNA and 
PGR primers from two non-contiguous cDNA clones. 

Industrial Utility 

The immunoreactive compositions of the 

2 0 invention, have utility in the preparation of materials, 

for example, vaccines, which in turn may be used for the 
treatment of individuals against HGV infections, 
particularly chronic HGV infections. In addition, the 
compositions may be used to prepare materials for the 
25 detection of multiple variants of HGV in biological 

samples. For example, the immunoreactive compositions of 
the present invention can be used to generate polyclonal 
antibody compositions that recognize more than one HGV 
isolate, or as the antigen in an anti-HGV antibody 

3 0 immunoassay. The latter method can be used to screen 

blood products for possible HGV contamination. 
Polyclonal antiserum or antibodies can be used to for 
passive immunization of an individual. 



35 



wo 93/06126 W or-r/t o« 

^ PCT/LS92/07683 

-66- 



Claims 



10 



WHAT 1 9v CLAIMED IS: 

1. An iminunoreactive composition comprising 
polypepti^des wherein the polypeptides comprise the amino 
acid sequeV:e of an epitope within a first variable 
domain of a Vepatitis C virus (HCV) , and at least two 
heterogeneou^amino acid sequences from the first 
variable domai\ of distinct HCV isolates are present in 
the composition, 



15 



20 



2. An Ymmunoreactive composition according to 
claim 1 comprising V pl^j^rtfrTE^-^Q^ antigen sets, wherein 
(a) each antigen set)^nsists of a\r)lurality of 
substantially identic^ polypeptide^comprising the amino 
acid sequence of An epitope within a Virst variable 
domain of an Hcv/isolat^ and (b) the Vmino acid sequence 
of the epitope dp one seAis heterogenebus wjjbh^ respect 
to the amino aciB sequencA of the an^ictfous sequence of 
at least one othdr set. 



25 



3, 



An 



Lmmunoreacmve composimon according to 



claim 1 wherein thia first hetdrogeneous aAino acid 
sequence is from art^CV group \ isolate aim the second 
heterogeneous amino ^cid seq[uenqe is from HCV group II 
isolate. 



4. An immunoreactive composition according to 
3 0 claim 1 wherein the variable domain Vs within the E2/NS1 
protein. 



35 



5. An immunoreactive compo^sition according to 
claim 4 wherein the variable domain is Vncoded from about 
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amino ^cid 3 84 to about amino acid 411 of the HCV 
polyprot^in. 



claim 1 whei 
protein • 



An immunoreactive composition according to 
iin the variable domain is within the El 



10 



7 . Aij immunoreactive composition according to 
claim 6 wherein tKe variable domain is encoded from about 
amino acid 22^ to About amino acid 260 of the HCV 
polyprotein . 



15 



20 



25 



30 



8. An immtmoreactive composition according to 
claim 1 wherein the polypeptides further comprise the 
amino acid sequence of ah epitope within a second 
variable domain of a hep^^titis C virus (HCV) , and at 
least two heterogeneousx'S5mino""a«4:d sequences from the 
second variable domaiyf of c^stinct^CV isolates are 
present in the compbaition. 



9 • An 11 
claim 8 wherein the 
E2/NS1 protein and tl 
the El protein. 



moreactiVe compos 
.rst variable doma 
second variable 



^tion according to 

•A 

n is within the 
is within 



10. An immui'kDreactive cdmpositAon according to 
claim 1 comprising a plurality of po^lypept\des wherein 
each polypeptide has theXformula 

wherein 

R and R' are amino acid sequences of about 
1-2000 amino acids, and are the same or i^ifferent; 

r and r' are 0 or 1, and are tn)p same or 

different; 
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V is an amino acid sequence comprising the 
sequencte of an HCV variable domain, wherein the variable 
domain qpmprises at least one epitope; 

S in an integer > 1/ representing a selected 
variable dpmain; and 

is an integer > 1, representing a selected 
HCV isolate heterogeneous at a given SV with respect to 
at least one o^her isolate having a different value for 
n, and n being Xindependently selected for each x; 

X is an integer > 1; and 
with the proviso Vhat amino acid sequences are present in 
the composition retoreseriting a combination selected from 
the group consisting of (i) IVj and IVj, (ii) IV^ and 2V2, 
and (iii) IV, and 2V; 

11. The imnuinoreactive composition according 
to claim 10 wherein the\poly:pepti^e formula is 

R^-lVj-lVj-RV' 



[tie polypeptide C( 
Ldes of the f 01 



2 0 12. The j^mihunor^ctive com^ 

to claim 10 wherein 
a mixture of polypepi 

R,-lVi-R'^ a: 

R,-1V2-R',. 

25 

13. A methodVfor prepaJ^ing V 
composition for treatmei^ of HCV coonpri 

(a) providing aVi immunogei 
according to claim 1; 

3 0 (b) providing a suitable ex^ 

(c) mixing the immunogenic 
with the excipient of (b) . 



)osition according 

It ion comprises 

Lae 



immunogenic 
ng: 

c composition 

ipient; and 
mposition of (a) 
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14. A method for producing anti-HCV antibodies 
comprlsln^g administering to a mammal an effective amount 
of an immuifloreactive composition according to claim 1, 

15\ A polyclonal antibody composition made 
according to ttjie method of claim 14 



16. ANjnethod of detecting antibodies to HCV 
within a biological sample comprising: 



to m 



10 (a) prov 

containing antibod 

(b) p 
according to c 

(c) 

15 the immunoreadtive composition' 
which allow t 
and 

(d) 

formed between 
2 0 the biological s 



a biological sample suspected of 
tiple strains of HCV; 
mmunoreactive composition 



rovidipg an 
ilAim 1; 

'reacting \he bi^ological sample of (a) with 

of (b) under conditions 
formation \f aAtigen-antibody complexes; 



itecting 



Cgen of 



Le of (b) 



lation of complexes 
and the antibodies of 
my. 



25 



17. A k\t for detecting;\^ntibodies to multiple 
strains of HCV with\n a biological \sample comprising an 
immunoreactive composition according^ to claim 1 packaged 
in a suitable container. 



30 



18. A DNA molecule encodingXa polypeptide 
comprising two heterogeneous amino acidXsequences from 
the same variable domain of distinct HCvXisolates . 

19. A host cell comprising a DN^ molecule 
according to claim 18. 
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20. A hqst cell according to claim 19 wherein 
the DNA molecule coimri^fiiscontrol sequences that are 
capable of causing expression of the polypeptide. 



21. A 
comprising growing 
to claim 2 0 under d 
expression of the p 




a recombinan't prctein 
f hos-t cells according 
provide for the 
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